Table IE- BLAST results for NOVla 



Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
{%) 


Positives 
(%) 


Expect 


ptnr : SPTREMBL- 
ACC:Q9D398 


6330415E02RIK 
PROTEIN - Mub 
musculus (Mouse) 


945 


862/945 
(91%) 


897/945 
(94%) 


0.0 


ptnr : SPTREMBL- 
ACC:O08722 


TRANSMEMBRANE 
RECEPTOR UNC5H2 


945 


862/945 
(91%) 


893/945 
(94%) 


0.0 


ptnr : SPTREMBL- 
ACC: 008747 


UNC-5 H0M0LOG (C 
ELEGANS) 


931 


610/929 
(65%) 


723/929 
(77%) 


0.0 


ptnr -.SPTREMBL- 
ACC: 095185 


TRANSMEMBRANE 
RECEPTOR UNC5C - 
Homo sapiens 


931 


598/929 
(64%) 


718/929 
(77%) 


0.0 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table IF. In the ClustalW alignment of the NOV1 proteins, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



i) 

2> 
3) 
4) 
5) 
6) 



NOVla 
NOV lb 
ptnr: 
ptnr : 
ptnr : 
ptnr : 



Table IF. ClustalW Analysis of NOV1 

(SEQ ID NO: 2) 
(SEQ ID NO:4) 

6330415E02RIK PROTEIN - Mus musculus (Mouse) (SEQ ID NO: 33) 
TRANSMEMBRANE RECEPTOR UNC5H2 (SEQ ID NO: 34) 
UNC-5 H0M0LOG (C. ELEGANS) (SEQ ID NO: 35) 

TRANSMEMBRANE RECEPTOR UNC5C - Homo sapiens (SEQ ID NO: 36) 



NOVla 

NOVXb 

Q9D398 

008732 

008747 MRKGWWTA 

09518 5 MRKCLRATA 




NOVlo 

NOVlb 

Q9D398 

008722 

008747 

09518S 

NOVla 

NOV lb 

Q9D39S 

0Q6722 

008747 

09S185 

NOVla 

NOVlb 

Q9D398 

008722 

008747 

09518S 

NOVla 



LScRAFPATQIYFKoJjtWScflDHVTQ^ 

iflr-DnenATrtT vFvrnT4~PWV5f)iTiDHVT0E§LDEATGLRVREV0 I EVSRQQVEELFGLEOYWCQCVAWSSaGTTKSRRAYjj 
'?HVTOc5lDEATCLRQrEVQ I EVSROOVEELFGLEDYWCQCVAWSSSGTTKSRRAVf 



jVREVgl EMS RQQVEE LFCj 
JvREvBl ERS RQQVEE LFGj 



GEE DY W CQCVAVJ 3 S AGTT XS RE AYV 
'GfvRDYW CQCVAW S S ACTT KS El^A Y V 



RIAYLRKTF^QEPLGKE 



PECgPVASVEWLK jEDjj I 
PEG{tPVAEVEWLK»"EcjjI 



VAKRRSTTATVTVYV^C>JS^^ 
SivAKRRETTAWIVYV.^WSSWAEWSPCS^PX'GRGWQKRTRTCTSiPAPI^GAFCEGO^QKTAC'n 



H S PCS&R CG RGWQ KRT R 

I VA KRR C TTATV I VY vSgGWS S WAEW5 PCSgR CGRGWQ KRTRTCTjPA P I ijGCAFCEGOABQKTACTTyC P VDG/ 

ivakrrstQatvivyv^wsswaewspcsSrccrgwqk^ 

, i vakrrc-ttatv ivyvSgcwsswaews pcsSrcgrgwqkktrtctSpapd^gafcsgqa^qktacttScpvdg; 



AK J I VAKRRSTTATV IVYVSGGWSSWAEWS PCS§jR CGRGWQ KKT RTCrJpA P LiJjGAFC E GQAjj Q XTACTTjvt 
AK'S I VAKRSSTTATV I VYvSgCWSuWuEV/sSoSs RCGRGHq KRT RTC1 sPAPUJjGAFCECO^ UQi'*nACTTj»! 
AK £ I VAKR^STTATVIVYvSgCWS^U^S^ 



CPVDGAW1 
ACTThCFVDGAWI 

acttBcpvdgawi 

ACTTi*CPVDGJ^I 
iACTTiSc PVDGiyNl 



146 
146 
146 
146 
160 
160 

226 
226 
226 
226 
240 
240 

306 
306 
306 
306 
320 
320 

374 
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NOVla 

MOVlb 

Q9D398 

006722 

O08747 

095185 




NOVla 
NOV Lb 
Q9D398 
008722 
008747 
095165 

NOVla 
NOVlb 
Q9D39B 
008723 
O08747 
09S1B5 

NOVla 
NOVlb 
Q9D398 
008722 
008747 
09S185 

NOVla 

NOVlb 

090398 

008722 

O08747 

095185 

NOVla 

NOVlb 

Q9D398 

008722 

008747 

095185 



OgALKCTFTLEKHSU\STELTCK{jCVROVEGEGQ I FQLHTTLA 
oSaLH CTFT LE P H S LA S TE LTCKnCVRQV EC EGO I FO LHTTLA 



OfcALH CTFT LE P H S LA S TE LTCKJICVRQV EG EGO I FQ LHTTLA ET ? AGS LC y 
QflALHCTFTLERHSLASTEf^CKScVRQVEGEGQIFOLHTTLAETPAGSLcJj 

oSalkctftlerhslasteBtckJcvrovecegqifqlhttlaetpagslcIJ 



OgALKCTFTLERHSLASTEgTCKjcVROVECEGQIFQLHTTLAETP 
C^LHCTFTLErfJSDjnjjELKCKgCVROVEGECQ I FQL»TgTggEgP' 
oSf LHCTFTLEFuS L»5* S S I Sc KgCV RQV EGEGQ I FQ LSfcTgfoE ^ V 



EGEGQI FQLHTTLAETPAGSLCQLL-SAl»GSTjrrTOLGPYAFKI PLSIRQKI 
EGEGQ 1 FQLHTTLAETPACS LC ElCSAPCStStTOLGPYAFK I PLS I RQK J 
EGEGQI FOLHTTLAETPAGSLCgLCSAPGjTflTTOLGPYAFKI PLS I R0K1 
EGEGQ I FQLHTTLAETPACS LC2LCS A PC J^QtTOLGPYAFK I PLS I R0K1 

3ipl5iro> 

IRO 



SlpllcBa^QiB* 



854 
854 
866 
866 
652 
2y 852 



C^LDAH»ySRGjDWRMLAOKLSMDRYLiIiYFATKAS PTGV 1 LDLVJtA|»gUUUOJl/V|il->\aMUt.tMU»v^»c..-ii-vM^\ i joui 
□Jjs LOA pfls RG?D W3MLAQKLSMDR Y LwY FATKAS PTGV I LDLWEAgOODDGDLiJs LASALEEMGKSEMLVaJ\TDGDC 
CSSLDApSs RG'JdWRjJlAQKLSMDRYL'uY FATKAS PTGVTLOLWEAK'QQDDGDL^SLASi'VLEEMGlCSEMLVA^ATDGDC 
UTS LDAP>JsRgSdwr1laOK.LSMDRYI>Sy FATKAS PTGV I LDLWSA^OODDGDbJs LASALEEHCKSEMLVA^nTOGOC 

css ldapoSrgiidwrmlaWkl'^JdrylSy fatkGs ptcv i l d lw e J ls j la JEle emg35e 2a~'33 

^cm»^ g r H nnOMi aifc T fflnRYt fly FATR^ PTGV I LOLWE;. ♦^DGTLS?l^^LEENG.:gg^\^gK; S3 



933 
933 
94 5 
945 
931 
931 



The presence of identifiable domains in N0V1 , as well as all other NOVX proteins, 
was determined by searches using software algorithms such as PROSITE, DOMAIN, Blocks, 
Pfam, ProDomain, and Prints, and then determining the Interpro number by crossing the 
domain match (or numbers) using the Interpro website (http:www.ebi.ac.uk/ interpro). 
DOMAIN results for NOV1 as disclosed in Tables lG-lO, were collected from the Conserved 
Domain Database (CDD) with Reverse Position Specific BLAST analyses. This BLAST 
analysis software samples domains found in the Smart and Pfam collections. For Tables 1G- 
10 and all successive DOMAIN sequence alignments, fully conserved single residues are 
indicated by black shading or by the sign (|) and "strong" semi-conserved residues are 
indicated by grey shading or by the sign (+). The "strong" group of conserved amino acid 
residues may be any one of the following groups of amino acids: STA, NEQK, NHQK, 
NDEQ, QHRK, MILV, MILF, HY, FYW. 
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Precursor (EC 3.1.3.48)) (E = 0.0). Public amino acid databases include the GenBank 
databases, SwissProt, PDB and P1R. 

NOV2c is expressed in at least Synovium/Synovial membrane, Kidney. Expression 
information was derived from the tissue sources of the sequences that were included in the 
5 derivation of the sequence of CuraGen Acc. No. CG5071 8-05. The sequence is predicted to be 
expressed in the Rattus norvegicus glomerular mesangial. because of the expression pattern of 
(GENBANK-ID: gb:GENBANK-ID:AF063249|acc:AF063249.1) a closely related Rattus 
norvegicus glomerular mesangial cell receptor protein-tyrosine phosphatase precursor 
(PTPRQ) mRNA, complete cds homolog. 
10 Homologies among each of the above NOV2 proteins will be shared by the other 

NOV2 proteins insofar as they are homologous to each other as shown below in Table 2G. 
Any reference to NOV2 is assumed to refer to all three of the NOV2 proteins in general, 
unless otherwise noted. 



1 5 Table 2G Alignment of NO V2a, b, and c 

10 20 30 40 50 60 

| .... | .... | .... | .... | .... | | .... 1 .... | ... . | .... | .... ( 

Nov2a ^j^^BSSSESS^^BS^S^i ^^SB^^^^EBSBP ^^ ^^m 58 

NOV2b - - - 1 

20 NOV2c ffl»M»ilJMMJAMi*ftM» l^ 60 

70 60 1 90 100 110 120 
| | | | | | | | | | | | 

NOV2a ^jB^^s^^^^^ ^^^^SB^^^SS^StES^s^^^^^ 

25 NOV2b ! 

nov 2c ^b^^^MBBSSMBM^^ S3BS^!^SSMSE^^^^BSE 12 ° 

130 140 150 160 170 180 

30 

Nov2b - 7rJ_7r!7 ! 

NOV2c lilMsrcfciliifc'isMCVJi^i^ 180 

^ 190 200 210 220 230 240 

NOV2a ATKK^jmk^Z^NfiSBpy^ 23 5 

HOV2b - ________ x 

NOV 2 c QPNGjM-_Jiaj-----^^ 24 0 

40 250 260 270 280 290 300 

I I I I I I i I I | | | 

NOV2a t-lli»-islj-g-IAilnt!l-l-l-^ 294 

NOV2b - X 

45 — "--^Tf - mi ana 

310 320 330 340 350 360 

| | | | | | | | | ,|....|.. . . I 

Nov2a 3Bjfcj_H33_B.^^ 354 

NOV2b _ x 

so nov2 c mMEMEMBssEmEMSsmEssmsmm-mBsmm ^ 

370 380 390 400 410 420 

I I I 1 I I .... | 1 | | | | 

Nov2a JSSS^I^BSSSS^Sa^^^^^E^^SSSS^S^SEB^^S^^^ 414 
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N0V2b -- - x 

kov^c ggggHgBg^EB3^^B53MI^Bi33ffl^M3SIEES^E33iiB 4 i* 

430 440 450 460 470 480 

. . ■ . | | I I • - • ; V — I — I — I — I ■; ♦ ♦ 1 I I 

Nov2a fl^^^^ ^^^^^^^^^§^^^^^E^ s3S^B33MBIB ^^^fflj^^B3 474 
nov2c j^^^^g^^^^^^js^^l^SSMS^SS^SnS^SS^B^^ 479 

490 S00 510 520 530 540 

— - | | | | | | | | | , — 1 

NOV2a P>sra»]iKMiFi™*»'<^!l;Ji)^ 534 

NOV2b --- - 1 

NOV2c 539 

550 560 570 560 590 600 



NOV2a gWAElflGFgRjSJHgSSASRDNlJISPG 0L@AQNFRVT^^lgESpt5^^HBa" " PVF 588 

HOV2b - - l 

NOV2C ^EHI^SV@A3J5^EGPPTV^VRTRQQVQs@r§IINYK^3s§S§L2Y^^^^YPNGK 599 

610 620 630 640 650 660 

K0V2a FHgBlifrlEDV^OSKSlFtLRTi;^ 646 

NOV2b - 1 

NOV 2 c ITggTflYAftElgTNRAFOfhTID^ 6 59 



670 680 690 700 710 720 

I 1 I I | ... ~| .... | 1 I I I | 

NOV2a bM^sMolJ&MsMWlDlJJiWW^ 708 
NOV2b - - 1 

nov2c taro ;* wi>xmi»w*>?3**\?M ^ ttf r T Wi^ifti^iti!ii!B^iif^tj 719 

730 740 750 760 770 780 

1 | | | | | | | | 

NOV2a ItAllMgflJfjiaflgiMBaMaMfla^iaL^^ 768 

NOV2b - : i 

NOV2c M»>»+*;KW:t»J:fcWff*fJi*J;Wli;ja<^ 779 

790 800 810 820 830 640 

....!....[.... |... .| | | — | — | — | | | — | 

NOV2a IdlMalllJaf^JMellfiVVJiiMIIHrJgM^ Q2 B 

NOV2b - i 

NOV2c IMMilMiMddMtI*i»K4WiWJIIM^ 937 

850 860 870 860 890 900 

•■••I--- - I - - . , I , , ■ , I I - ■ -T I j I I | I 

NOV2a M:WJ:^VJ^«tfc<i3t:>l»ftVJ»MJ^^ 886 

NOV2b - 1 

NOV2c M:wire*;»CTiWi3i3i»nia»i=ia^^ 397 

910 920 930 940 950 960 
I I 1 I I I I 1 I I I I 

Nov2a g3gaiiMBBHB535EEfl^ 946 

NOV2b 1 

KOV2c l^JK^JMJi^iMINMtllU^^^ 9S7 

970 980 990 1000 1010 1020 
| | | | | | | | |.,,,| | | 

Nov2a ^f^^^Q^^^^si^^SEES^^S^^BSS^M^ESE!^^^ 1006 

NOV2b - i 

NOV2c k'JJMiMai^MllaMilJiMa^^^ 1017 

1030 1040 1050 1060 1070 1080 

I I I I I I I i I I 1 I 

NOV2a WM^TidaMftikfafcl^ 1066 

NOV2b EH -a^^S ^^^^^^^^E 26 

NOV 2 c LlflUtl3t»il^|tflfU(flgjU^^^ '.. 1 f^%^^fTft)^^ f\- ^fffi 1077 

1090 1100 1110 1120 1130 1140 
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tun., 1*tt» 1..t7 "UV JT taP ~ 0 !« TwiP «id» M...P «iibi UfcM 



NOV2a 
NOV2b 
NOV2c 



I 



PAQPNGLVFYYVSLILQQTPRHVRPPLVTYERSI YFDNLEKYTDY ILKITPSTEKGFSD7 
PAQPNGLVFYYVSLILQQTPRHVRPPLVTYERSI YFDJsJLEKYTDYTLKITPSTEKGFSDl 
P AQ PNGLVF Y Y VSL I LQQTPRHVRPPLVTYERS I YFDNLEKYTDY I LKITPSTEKGFSD7 



1126 
86 

1137 



10 



1150 

..I... 



1160 

I I 



1170 1180 



1190 



YTAQLYIKTEEDuPETSPIINTFKNLSSTSVLLSWDPPVXPNGAIISYDLTLQGPNErJYS 
YTAQLYIKTEEDVPETSPIINTFKWLSSTSVLLSWDPPVKPMGAIISYDLTLQGPNENYS 
YTAQLY1 KTEEDVPETSPI INTFKNLSSTSVLLSWDPPVKPNGAI ISYDLTLQGPNENYS 



1200 
. | 

1186 
146 
1197 
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KOV2a 
NOV2b 
MOV2c 



1210 

» - 1 I 



1220 



1230 1240 

» • I 1 I ■ ■ ■ 



1250 

..|....| 



FITSDNYIILEELSPFTLYSFFAAARTRKGLGPSSILFFYTDESVPLAPPQNLTLINCTS 
FITSDNYII LEELSPFTLYSFFAAARTRKGLGPSSILFFYTDESVPLAPPQNLTLINCTS 
FITSDWYIILEELSPFTLYSr FAAARTRKGLGPSSILFFYTDESVPLAPPONLTLINCTS 



1260 

.| 

1246 
206 
1257 



20 



NOV2a 
NOV2b 
NOV2c 



1270 



1280 



1290 1300 



1310 



DFVWLKWSPSPLPGGIVKVYSFKIHEHETDT1 Y YKNISGFKTEAKJjVGLEPVSTYSIRVS 
DFVWLKWSPSPLPGGI VKVYSFKIHEHETDTIYYKNISGFKTEAKLVGLEPVSTYSIRVS 
DFVWLKWSPSPLPGGI VKVYSFKI HEHETDT I YYKWISGFKTEAKLVGLEPVSTYS I RVS 



1320 
.| 

1306 
266 
1317 



25 



30 



35 



40 



1340 



NOV2a | 
NOV2b 
NOV2c 



NOV2a 
NOV2b 
NOV2c 



NOV2a 
NOV2b 
NOV2C 



_L 



1350 1360 

.| | | | 



1370 



ftkvgngnofsnvvkfttqesvpdwqnmq^iatsv^qsvlvkwdppkkangiitqyiwt 
aftkvgngnqfsnvvkfttqesvpd\a:qnmqc^tswqsvlvkwdppkkangiitqymvt 
aftkvgngnqfsnwkfttqesvpdwqnmocmatswqsvlvkwdppkkangittqymvt 



1390 
..|....| 



14 00 

■ ■|.... I 



1410 1420 

■ . | | | | , 



1430 



vernstkvspqdhmytfiki^lantsyvfkvrastsagegdestchvstlpetvpsvptnt 
vernstkvspcdhmytfikllantsyvfkvrastsagegdestchvstlpetvpsvptni 
vernstkvspqdhmytfikllantsyvfkvrastsagegdestchvstlpetvpsvptni 



1440 

. I 

1426 
386 
1437 



1450 

♦ I — I 



1460 



1470 1480 



1490 



1500 



, I 



^fsdvqstsatltwirpdtilgyfqnykittqlraqkckeweseecveyqkiqylyeahl 
ftfsdvqstsatltwirpdtilgyfqnykittqiiraqkckeweseecveyqkiqylyeahl 
?\fsdvqstsatltwirpdtilgyfqnykittqlraqkckeweseecveyqkiqylveah!. 



45 



N0V2a 
N0V2b 
N0V2c 



1510 
. . 1 I 



1520 
■ . I | ■ 



1530 1540 
I .... I .... I .... | , 



1550 
. . I I 



TEETVYGLKKFRWYRFQVAASTNAGYGNASNW1STKTLPGPPDGPPEWVHWATSPFSXS 
TEETVYGLKKFRWYRFQVAASTNAGYGNASNWISTKTLPGPPDGPPENVHWATSPFSIS 
TEETVYGLKKFRWYRFQVAASTNAGYGNASNWI STKTLFGPPDGPPENVHWATSPFS IS 



1560 
. | 

1546 
506 
1557 



50 



N0V2a 
N0V2b 
NOV2c 



! 



1570 

■ .1 I 



1580 



1590 1600 

- ♦ I I I I 



1610 

.-I I 



ISWSEPAVITGPTCYLIDVKSVDNDEFNISFIKSNEENKTIEIKDLEI FTRYSWITAFT 
ISWSEPAVITGPTCYLIDVKSVDNDEFNISFIKSNEENKTIEIKDLEI FTRYSWITAFT 
ISWSEPAVITGPTCYLI DVKSVDNDEFNISFIKSNEENKTTETKDLEI FTRYSWITAFT 



1620 
. | 

1606 
566 
1617 



55 



60 



65 



70 



NOV2a 
NOV2b 
NOV2c 



NOV2a 
NOV2b 
NOV2c 



NOV2a 
NOV2b 
NOV2c 



1630 



1640 



1650 1660 

|. ... I .... I 



1670 



GNISAAYVEGKSSAEMIVTTLESAPKDPPNNMTFQKIPDEVTKFQLTgLPPSQPNGNIOV 
GNISAAWEGKSSAEMIVTTLESAPKDPPNNMTFQKIPDEVTKFQLTSLPPSQFNGNIQV 
GNISAAYVEGKSSAEyilVTTLESAPKDPPNNMTFOKIPDEVTKFQLTSLPPSOPNGNIQV 



1680 
. | 

1666 
626 
1677 



1690 
> I I 



1700 



1710 1720 
..|....|....|... 



1730 
..|....| 



YQALVYREDDPTAVQIHNLS1 IQKTNTFVI AMLEGLKGGHTYNI SV YAVNSAGAGPKVPM 
YQALVYREDDPTAVQIHNLSIIQKTNTFVIAMLEGLKGGHTYNISVYAVNSAGAGPKVPK 
YQALVYREDDPTAVQIHNLSI IQKTNTFVIAMLEGLKGGHTYNISVYAVNSAGAGPKVPK 



1740 

A 

1726 
666 
1737 



1750 



1760 



1770 1780 
, . | | | , 



1790 
. . I I 



RI TMDI KAPARPKTKPTPI YDATGKLLVTSTTITI RMP ICYYSDDKGPI KNVQVLVTETG 
RITMDIKAPARPKTKPTPIYDATGKLLVTSTTITIRMPICYYSDDHGPIKNVQVLVTETG 
R I TMDI KAPARPKTKPTPI YDATGKI..LVTSTTITI RMP ICYYSDDHGF I KNVQVLVTETC 



1800 
.| 

1786 
746 
1797 
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NOV2a 
NOV2b 
NOV2C 



I 



1810 



1820 1830 
. - I I 1 I 



1840 1850 

,,| | | |. 



'XQHDGNVTKWYDAYFNKARPYFTNEGFPNPPCTEGKTKFSGNEEI YI IGADNACMI PGNh 
AQHDGNVTKWYDAYFNKARPYFTNEGFPNPPCTEGKTKFSGNEEI YIIGADNACMI PGME 

aqhdgnvtkv:ydayfnkarpyftnegfpnppctegktkfsgnf.etyttgadnacmtpgn:-: 



1860 

i 

1846 
806 
1857 



10 



NOV 2 a 
N0V2b 
N0V2c 



I 



1870 

..|....|. 



1680 



1890 1900 

. | | | | . > . . I 



1910 



dkicngplkpkkqylfkfratnimgqftdsdysdpvktlgeglsertQei ilsvtlcils 
dkicngplkpkkqylfkfratnikcqftdsdysdpvktlgeglsertle| 
dkicngplkpkkqylfkfratnimgoftdsdysdpvktlgeglsertlei ILSVTLCILS 



1920 

. I 

1906 
855 
1917 



15 



20 



25 



30 



N0V2a 
N0V2b 
NOV2c 



N0V2a 
N0V2b 
NOV2c 



1930 
I - I I 



1940 



1950 1960 

, , | | 1 | . 



1970 
..I... 



1980 



r I llgtai fafar I rqkqkeggtys PQDAEI I DTKLKLDQLITVADLELKDERLTR 



1 1 LLGTA I KAFAR I RQKQKEGGTYS PQDAE I 1 DTKLKLDQLITVADLELKDERLTRl 



N0V2a 

N0V2b 

N0V2c RKSi; 



1990 2000 2010 



2020 
I I 



2030 



piskksflqhvselctnnnlkfoeefselpkflodlsstdadlpwnraknrfpni 



<PISKKSFLQHVEELCTNNNLKFQEEFSELPKFLQDLSSTDADLPWNRAKNRFPNI 



1962 

855 
.LSY 1977 



2040 

i 

2017 
855 
2037 



2050 2060 2070 
. . | | | | j . , , 



2080 
..|....| 



KPYNNNRVKL1 ADASVPGSDYINASYISGYLCPNEFIATQGPLPGTVGDFWRMVWETRAK 



KPYNNNRVKLIADASVPGSDYINASYISGYLCPNEFIATQGPLPGTVGDFWRMVWSTRAK 



2090 2100 

2077 
855 
2097 



35 



N0V2a 
N0V2b 
K0V2c 



2110 2120 2130 2140 

.|....|....|....|....|....|....|....|....| 



2150 

- - I I 



TLVMLTOCFEKGRIRCHOYWPEDNKPVTVFGDIVITKLMEDVOIDVJTIRDLKIERHGDC 



TLVMLTQCFEKGRIRCHQYWPEDNKPVTVFGDIVITKLMEDVQIDVJTI RDLKIERHGDCr' 



2160 
. | 

2137 
655 
2157 



40 



NOV2a 
NOV2b 
NOV2c 



2170 2180 2190 2200 

.|....|....|....|....|....|. . . J. . ..I ... 



TVRQCNFTAWPEHGVPENSAPLIHFVKLVRASRAHDTTPMIVHCSAGVGRTGVFIALDHL 



TVRQCNFTAVJPEHGVPENSAPLIHFVKLVRASRAHDTTPMI VHCSAGVGRTGVF I ALDHL 



2210 2220 
.|....|....| 

2197 
855 
2217 



45 



50 



55 



K0V2a 
NOV2b 
N0V2c 



NOV2a 



N0V2C 



2230 2240 
• - I I I I 



2250 
..|....| 



TQHINDHDFVDI YGLVAELRSERMCMVQNLAQYI FLKQCILDLLSNKGSNOPICFVNYS/ 1 



TQHINDHDFVDI YGLVAELRSERMCMVQNLAQYI FLHOCILDLLSNKCSNQPICFVNYSA 



2260 2270 2280 

.|... . | . ... | .... | .... | 

2257 
855 
2277 



2290 



2300 



LQKMDSLDAMEGgDVELEWEETTM 



2281 (SEQ ID NO:6) 
NOV2b 855 (SEQ ID NO: 8) 



LQKMDSLDAMEGflDVELEWEETTM 



2300 (SEQ ID NO:10) 



The disclosed NOV2a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 2H. 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



N0V2A 

gi|12621078| 
gi j 125977| 
gi j 10728873 1 
gi |7290546| 
gi | 136262S| 



Bfliifl.li.fi 

jJgFHFSFLR 

MG 

-M( 

WcATRKQI 



NOV2A 


IGAS--NEP 


9i 


12621078| 


TLATNVTKP 


9i 


125977) 




qgvrH 




gi 


10728B78| 




QGVRjj 




gi 


7290546) 


oft 


FVWLffl 




gi 


1362625) 


QfiN 


FVWI^ 





NOV2A 

gi|l2621078| 
gi| 125977| 
gi | 10728878 | 
gi | 7290546 | 
gi|l36262S| 



NOV2A 

gi|l2621078| 
gi|l25977| 
gi j 10728878 | 
gi j 7290546 | 
gi 1 1362625 j 



NOV 2 A 

gi|l2621078| 

gi|125977| 

gi|l0728878| 

gi|7290546| 

gi|1362625| 



NOV 2 A 

gi|l2621078| 
gi j 125977 | 
gi|10728878| 
gi|7290546| 
gi j 1362625 | 



NOV2A 

gi|l2621078| 

gi|l2S977| 

gi|l0728878| 

gi|7290546| 

gi|l362625| 



NOV2A 

gi|l2621078| 

gi|l25977| 

gi|10728878| 

gi|7290546| 

gi|l362625| 





270 280 

|l S FffljTHLRPYTTYLPEL ^ 
[IS Ffl^HLRP YTTYLFJJ/^y' 
pSRPP|TIS: ' 

IpsrppStis: 

| PPFPAI 
fgPPFPAGj 




310 



320 



330 












P 
P 


jGPgO 




340 350 
|....|....|....| 
rWDPPTIVTGKFSYRVELYGPS 
S WDPPTI VTGKFS YRVELYGP - 

QESAN 

QESAN 

SVQT 



33JSVQT- 



360 370 
....|....|....|....|....| 
AGR I LDNS TKDLKFA F TNltT P FTMYDV 
SGRILDNSTKDLRFAFTHLTPFTMYDV 



380 



400 




36 



450 



NOV2A 

gi| 12621078| 
gi | 125977 | 
gt 1 10728878 | 
gi|7290546 | 
gi|l362625 j 



NOV2A 

gi 
gi 
gi 
gi 
gi 



12621078| 
125977| 
10728878| 
7290546| 
1362625| 



NOV2A 

gi|12621078| 

gi|l25977| 

gi|10728878) 

gi|7290546| 

gijl362625| 



NOV2A 

gi| 12621078| 
gijl25977| 
gij 10728878) 
gi|7290546| 
gi|l362625 j 



NOV2A 

gi|l2621078| 
gi|l25977| 
gij 10728878) 
gi|7290546| 
gijl362625) 



NOV2A 

gi|l2621078| 
gij 125977| 
gij 10728878) 
gij 7290546 | 
gi j 1362625 j 



NOV2A 

gi 1 12621078 | 
gij 125977 | 
gij 10728878 | 
gij 7290546 | 
gij 1362625 j 



NOV 2 A 

gi|l2621078| 
gij 125977 | 
gijl0728878| 
gij 7290546 | 




460 470 

tllSgnnH- indpHapEIwt^q 

TLli^DlsiSMPMSPglMNli^ 

QAFSEISG II " ' 

QAFSEISG---ir 

VPrMaNGB sjtfrfFfiYP&I 

VPRfeiANGn S^yHySBI 

520 



500 



490 

iSSDLtfSLATFIYNSH 





530 540 550 

|....|....|....|....|....| 
JTRNQY ITDI AAEQLTYVLIRLRRFWAETMG 
!SBHV§TRNQYMTDITAEQLSYWRRLVPFTEHTIS 

ETKMESAP - 

ETEMESAP - 



I 



560 

!■ 



\ 



570 
I 



600 



I 



FSRYTIMSSASRDNLTSPG PLSaJBnfRVT] 

VSAF^IMGEGPPTVLTVRTREQVPSSIQlgNYgNtSl 

sovBrts 
8qvBt^s| 

-GFjLJDl 
-GI 




620 

,|....| 

iilrt; 

TEL^^NR AFQMTTVD] 

"jnSpeaswnSqmvdi 

SNQPEAsWsbMVDI 

ASHV pWpFPV, 

ASe} v PgPFPV, 




650 



690 
I 



I 



700 



660 670 680 

.,[ |....|....|....|. 

|endifvrtsedepesspqdvevidvtadeirlicwsppek 
|endi fvrtpede pes spqdvqvtgvs ps elrlkws ppek 

JTP - 

JTP 

Kg - 

£G 

710 720 730 
| | | |.-..| 

pngiiiayevlyknidtlymkntsttdSili 
png 1 1 1 a yevl»yqnadtlfvknts ttd3i i s; 

-KoUkEqQG' 
_^QG , 

760 770 

....|....|... .).... 

HGNQVSSLLSVRTSESVPC 
HGNQSSSLLSVRTSETVPD 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



Si | 1362625) 



NOV2A 

gi|12621078| 
gi|l25977| 
gi j 10728878 | 
gi I 7290546 | 
gi|l362625| 



NOV2A 



9i 

gi 

9i 
9i 



12621078| 

125977| 

10728878) 

7290546) 

1362625| 



NOV2A 

gi|12621078| 
gi|l25977| 
gi | 10728878 | 
gi j 7290546 | 
gi|l362625| 



NOV2A 

gi|l2621078| 

gi|125977| 

gi|l0728878| 

gi|7290546| 

gi|l362625| 



NOV2A 

gi| 12621078) 
gi)l25977| 
gi j 10728878 | 
gi|7290S46| 
gijl362625| 



NOV2A 



gi 
gi 
gi 
gi 
gi 



12621078| 

125977| 

10728878) 

7290546| 

1362625) 



NOV2A 

gi) 12621078) 

gijl25977| 

gi|l0728878| 

gi|7290546| 

gi|l362625| 



NOV2A 

gi|12621078| 
gi|l25977| 



820 830 840 850 



900 




950 



1000 



1010 



1020 



1030 1040 1050 

I I I I I I I I I I 

SVYYRNTSGTFMQNFTLHEVTNDFDNMTVSTIIDKLTIFSYVTFWLTAST 
SVYYQNTSGTFVQNFTLLQVTKESDNVTVSARIYRLAIFSYyTFWLTAST 



1060 1070 1080 1090 1100 

— I — ■ — ■ — | — ■ — ■ — | — •JL-i-^l - 

SVGNGNKSSDI I EVYTDQDVPEGFVGNI/TYESI SSTAI NMSBvflPA 
S VGNGNKS S D 1 1 HVYTDQD I PEGPVGNLTFESI SSTAIHyspflpSg 
HIHAQEL 

— hihaqelJuje 

T 



1110 1120 1130 

| | .)....). ...|. | 

H|LILQQ$P -RHVRPPLVTYERSr 

|lnlqq|pprhmi pplvtyems| 

SPFK 





1160 




1170 1180 

!EDIPETSPl8NTFgNLSS| 
iEDVPOTPPIgNTFRNLSsI 
TPSGVPVRPTVSgKI^REPI 

38 




1150 



1200 



-a rf-y : 



4.tth. •L.-r t«p na.P »r w .~r ~t a: 5% «? «5m W» p«w 



9i|l0728B78| 
gi | 7290546 | 
gijl362625| 




1210 



1220 



1230 



1240 



10 



NOV2A 

gi|l2621078| 
gi|l25977| 
gi I 10728878 | 
gi |7290546| 
gi|l362625| 



| | 

j mgpnenysfitsd: 

j Hgphanhtfvtsg] 

^GVKDffiALKEEMLSGPQMjjKKR 
tfGVKD®ALKEEMLSGPQMgKKR 

~~ DSTR' 
USTR' 



1250 




15 



20 



25 



30 



1260 



1270 



1280 



1290 



1300 



NOV 2 A 

gi| 12621078 | 
gi j 125977 | 
gi j 10728878 | 
gi|7290546| 
gij 1362625| 



NOV 2 A 

gi|!2621078| 
gij 125977) 
gi|10728878| 
gi|7290546| 
gij 1362625| 





1310 1320 1330 1340 1350 

-|....|....|....|....|....|....|....|....| 
ETDTI Y YKN I SGFKTEAKLVGLEPVSTYS I RVSAPTKVGNGNQP 
ETDTVFYKNISGLQTDAKLEGLEPVSTYSVSVSAFTKVGNGNQY 



35 



40 



45 



50 



55 



60 



65 



NOV 2 A 

gi|l2621078| 

gijl25977| 

gi|l0728B78| 

gi|7290546| 

gi|l362625| 



N0V2A 

gi 1 12621078| 
gi jl25977| 
gijl0728878| 
gi j7290546| 
gijl362625| 



NOV2A 

gi|l2621078| 
gijl25977| 
gij 10728878 | 
gi j7290546| 
gi j 136262S | 



NOV2A 

gi | 12621078 | 
gijl25977| 
gij 10728878 | 
gi j 7290546 ] 
gijl362625| 



1360 13 70 13 80 1390 1400 

....|....|....|....|....|....|....|....|....| 
SNWKFTTQESVPDWQNMQCMATSWQSVLVKWDPPKKANGIITO , 
SNWEFTTQESVPEAVRNIECVARDWQSVSVRWDPPRKTNGI I IHYMll 



-KX- 
-KK- 



-ID 
-ID 



1410 



1420 



1430 



1440 1450 

L • •JL- - I 

chMstHpe 

CDISTgPE 

:te0 

TEE 

SLPWQR 
jSLPjjQR 




70 



NOV 2 A 



1560 
wggT^TLPGpggG 



1570 



.tSpfs 



1580 
. . I . . . 



I 



1590 



39 



1600 



J. 



t*5« kuf "hnP «r" «hu> 



gi| 12621078| 
gijl25977| 
gi j 10728B78| 
gi j 7290546 | 
gi 1 1362625 j 




NOV2A 

gi| 12621078| 
gij 12S977| 
gi | 10728878 | 
gi|7290546| 
gi|1362625| 



NOV2A 

gi | 12621078 | 
gi j 125977 | 
gij 10728878| 
gi|7290546| 
gij 1362625| 



NOV 2 A 

gi| 12621078| 
gi jl2S977| 
gi|10728878| 
gi j 7290546 j 
gijl362625| 



N0V2A 

gi| 12621078| 
gi|l25977| 
gi|l0728878| 
gi|7290546| 
gij 1362625| 



NOV2A 

gi|12621078| 
gi|l25977| 
gi|l0728878| 
gij 7290546 | 
gijl362625j 



N0V2A 

gi| 12621078 | 
gij 125977 | 
gijl0728878| 
gij7290546| 
gijl362625j 



NOV2A 

gi|l2621078| 
gijl25977| 
gij 10728878 | 
gij7290546| 
gi|l362625| 



1610 1620 1630 1640 1650 

|....|....|....|..-.|....|....|....|....|....| 
3NDEFNISPIKSNEENKTIEIKDLEIFTRYSWITAFTGNISAAYVEGK 
DDDDFNISFLKSNEENKTTEINNLEVPTRYSWITAFVGNVSRAYTDGK 



1660 

S S AEM I VTTLES AP KDPl 
SSAEVI I TTLES VPKDP| 
-VF 
-V 



1670 



1680 



1690 



1700 




1960 1970 1980 1990 2000 

40 
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N0V2A 

gi | 1262X078 | 
gi|125977| 
gi | 1072BB78 | 
gi|7290546| 
gi j 1362625 | 




I 

AgM,KDER 
jj RR JNFbTPG 
tfRRflNFyTPG 
iGYffiTPN- 
P§GY§TPN- 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



2010 



2020 



2030 



2040 



2050 



NOV2A 

gi | 12621078 | 
gi|l25977| 
gi |10728878| 
gi |7290546| 
gi|l36262S| 



NOV2A 

gi | 12621078 | 
gi |125977| 
gi|l0728878| 
gi|7290546| 
gijl362625| 



NOV2A 



91 

gi 

gi 

gi 



12621078| 
125977) 
10728878 | 
7290546| 
136262S| 



NOV2A 

gi|l2621078| 
gi|l25977| 
gi | 10728878 | 
gi|7290546| 
gi|l362625| 



NOV2A 



gi 
gi 
gi 
gi 
gi 



12621078| 

125977) 

10728B78| 

7290S46| 

1362625| 



NOV2A 

gi|l2621078| 

gi|125977| 

gi|l0728878| 

gi|7290546| 

gi|l362625| 



NOV2A 

gi | 12621078 | 
gi | 125977 | 
gi | 10728878 | 
gi|7290546| 
gi|l362625| 



3t- 

STRLLSYRKSlj 

! IS HP 

! IS HP 




2100 



2150 



2200 



2250 



2300 



23S0 



«t2* *Wt' *hr.P ir "wV - *B »t» jkJi th-iT 'lad* l» u* 



10 



15 



20 



25 



30 



35 



40 



45 



NOV2A 

gi| 12621078| 
gi|l25977| 
gi j 10728878 | 
gi | 7290546 | 
gi|136262S| 



NOV 2 A 

gi|l2621078| 
gi|l25977| 
gi 1107288781 
gi|7290546| 
gi | 1362625 | 



NOV2A 

gi|l2621078| 

gi|l25977| 

gi|l0728878| 

gi|7290S46| 

gi|l362625| 



NOV2A 

gi |12621078| 
gi|l25977| 
gij 10728878 | 
gi|7290546| 
gi|l362625| 



2360 
|....|....| 



2370 
..|... 



2380 

..|... 



2390 



|....|, 



2400 

••I 



^PC^HRNSLVl&iPYESSRVYLTPI HGIEGSDYVNASFI DGYSY 
i LPCraHKN^VHmLPYESSRVYLTPIHGIEGSDYVNASFIDGYF Y 

! EDEMEnQuQQQSjQ LATEV \ P« 

gEDE^EQQgCX3QijQ LATEVgP 



2410 

..|... 



2420 



2430 
..|... 



2440 



.|....|. 



N0V2A 




gi|l2621078| 




gi|l25977| 


IAJ 


gij 10728878) 


IAj 


gi|7290546| 


DDI 


gi j 1362625 | 


DDI 



2450 

-•I 




ILTKLKEMGREKCFQYWP 
ILTKLKEMGREKCFQYWP 

(SSASCSSS 

[SSASCSSS 

2480 2490 
-|....|....|... 




2500 

••I 




2550 



KSGE 
KSGE 
ADED 
AD EE 



I D F I GQ VHKTKEQ FGQDG P I TVHCS AGVGRS GV F I TLS I VLERM 
IDFIGQVHKTKEQFGQDGPITVHCSAGVGRSGVFITLSIVLERM 

<WY 

WY- — - 



2560 
..|....|. 



2570 



2580 

..|... 



2590 



Q YEGVLDVFQTVRI LRSQRPAMVQTEDQYHFCYRAALEYLGS FDNYTN 
Q YEGVLD VFQTVRI LRSQRPAMVQTEDQYHFC YRAALEYLGS FDNYTN 



50 



Tables 2J-2EE list the domain descriptions from DOMAIN analysis results against 
NOV2a. This indicates that the NOV2a sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 2J. Domain Analysis of NOV2a 



gnl | Smart 1 smartOO 194 , PTPc, Protein tyrosine phosphatase, catalytic 
domain (SEQ ID NO: 93) 

CD-Length = 264 residues, 99.6% aligned 
Score « 318 bits (816), Expect = 2e-87 



NOV 1: 1983 KFQEEFSELPK-FLQDLSSTDADLPWNRAKNRFPNIKPYNNNRVKLIADASVPGSDYINA 2041 

-III -I ♦ Mill II II III- Mil lllllll 

Sbjct : 1 GLEEEFEKLQRLTPDDLSCTVAILPENRDKNRYKDVLPYDHTRVKL-KPPPGEGSDYINA S9 

NOV 1: 2042 S Y I S G YLCPNE F I ATQG PL PGTVGD FWRMVW ETRAKTLVMLTQC F EKGR I RCHQ YW P EDN 2101 

42 
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The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 3D. 



Table 3D. ClustalW Analysis of NOV3 

5 1) NOV3 (SEQ ID NO: 12) 

2) ref |XP_G27243.l| (XM_027243) hypothetical protein XP_027243 [Homo sapiens) (SEQ 
ID NO: 42) 

3) gi|15076843 |gb | AAK82958 . 1 | AF2334S0_1 (AF233450) pecanex-like protein 1 (Homo 
sapiens) (SEQ ID NO:43) 

10 4) gi|6650377|gb|AAF21809.1|AF096286_l (AF096286) pecanex 1 (Mus musculus) (SEQ ID 
NO: 44) 

5) gi| 13171105|gb|AAK13590.1|AF154413_l (AF154413) pecanex (Takifugu rubripes] (SEQ 
ID N0:45) 

6) gi| 7290294 | gb | AAF4 S755 . 1 | (AE003423) pcx gene product [alt 1) (Drosophila 
15 melanogasterj (SEQ ID NO: 46) 



10 20 30 40 50 60 

....|....|....|....|....|....|....|....|....|....|....|....| 

20 NOV3 - 1 

ref |XP_027243.1 - - 1 

gi | 15076843 |gb| MGSQTLQILRQGVWAALSGGWYYDPHQATFVNALHLYLWLFLLGLPFTLYr4ALPSTMIIV 60 

gij 6650377 |gb|A 1 

gi j 13 17 1105 | gb | MGSQTLQILRQGVWASVTGGWYYDPDQNTFVNALHLYIWLFLLCFPFTLYMALQPSMVIV 60 
25 gi | 72 90 2 94 | gb j A 1 

70 80 90 100 110 120 

....|....|....|....|....|....|....|....|....|....|....|....| 

NOV3 1 

30 ref |XP_027243 .1 -- --- - 1 

gi 1 1507 6843 | gb | AVYCPVIAAVFIVLKMVNYRLHRALDAGEVVDRTANEFTDQR-TKAEQGNCSTRRKDSNG 1 19 

gi|6650377|gb|A - 1 

gi j 1317 1105 | gb | GIYCGVIAAMFLLLKTVNYRLHHALDEGEWEHQTRESKGSRGGTGGANDPVTRREDSNG 120 
gi|7290294|gb|A 1 

35 

130 140 150 160 170 180 

....|....|....|....|....|....|....|....|....|....|....|....| 

N0V3 1 

ref |XP_027243 .1 1 

40 gi | 15076843 |gb| PSDPGGGIEMSEFIREATPPVGCSSRNSYAGLDPSNQIGSGSSRLGTAATIKGDTDTAKT 179 

gij 6650377 |gb|A - - -- 1 

gi 1 13171105 |gb| LGDPGGGIEMADFIRQETPPVDCSSRNSYVG- - 151 

gi|7290294|gb|A - 1 

45 190 200 210 220 230 240 

....|....|....|....|....|....|....|....|....|....|....|....| 

NOV3 1 

ref |XP_027343.1 - 1 

gl | 1507 6843 | gb | SDDISLSLGQSSSLCKEGSEEQDLAADRKLFRLVSNDSFISIQPSLSSCGQDLPRDFSDK 239 

50 gij 6650377|gb|A - - 1 

gi jl3171105|gb| - 151 

gi |7290294|gb|A 1 

250 260 270 280 290 300 

55 ....|....|....|....|....|....|....|....|....|....|....|....| 

N0V3 1 

ref |XP027243 .1 - - 1 

gi | 15076843 |gb| VNLPSHNHHHHVDQSLSSACDTEVASLVPLHSHSYRKDHRPRGVPRTSSSAVAFPDTSLN 299 

gi|6650377|gb|A - - 1 

60 gij 13171105 | gb| - - 151 

gi|7290294|gb|A --- 1 

310 320 330 340 350 360 

....|....|....|.--.|----|....|-...|..--l---.|----l----|.---| 

65 NOV3 - -- 1 

ref |XP_027243.1 1 



\-2u w '«t«n? i«? »r •?*..»!• ■ "ir a .„ tp w thm 



gi| 15076843 | gb| DFPLYQQRRGLDPVSELESSKPLSGSKESLVBNSGLSGEFQLAGDLKINTSQPPTKSGKS 359 

gi|6650377 |gb|A - 1 

gi|13171105|gb| -- 151 

gi|7290294|gb|A --- 1 

370 380 390 400 410 420 

....|....|....|....|....|....|....|....|....|....|....|....| 

NOV3 1 

ref |XP_027243.1 - 1 

10 gi | 15076843 |gb| KPLKAEKSMDSLRSLSTRSSGSTESYCSGTDRDTNSTVSSYKSEQTSSTHIESILSEHEE 419 

gi|6650377|gb|A 1 

gi |l3171105|gb| - — MDL 154 

gi|7290294|gb|A -- - 1 

15 430 440 450 460 470 480 

....|....|....|....|....|....|....|....|....|-..|....|....| 

NOV3 1 

ref |XP_027243.1 - - 1 

gi| 15076843 |gb| SPKAGTKSGRKKECCAGPEEKNSCASDKRTSSEKIAMEASTNSGVHEAKDPTPSDEMHNQ 479 

20 gi|6650377|gb|A - 1 

gij 13171105 |gb| NQRMSSTHGRTTVAKAPG- - 172 

gi (7290294 |gb)A - 1 

490 500 510 520 530 540 

25 .... | .... | .... | .... | .... | .... | .... | .... | .... | .... | .... | .... | 

NOV3 " " 1 

rof |XP_027243.1 1 

gi 1 15076843 | gb | RGLSTSASEEANKNPHANEFTSQGDRPPGNTAENKEEKSDKSAVSVDSKVRKDVGGKQKE 53 9 

gi|6650377|gb|A --- - 1 

30 gij 13171105 | gb | 172 

gi|7290294|gb|A - - 1 

550 560 570 580 590 600 

....|....|....|....|....|....|....|....|....|....|....|....| 

35 nov3 --- 1 

ref |XP_027 243 .1 - 1 

gi|l5076843 |gb| GDVRPKSSSVIHRTASAHKSGRRRTGKKRASSFDSSRHRDYVCFRGVSGTKPHSAIFCHD 599 

gij 6650377 |gb|A - - 1 

gi| 13171105 |gb| - -S 173 

40 gi|7290294 |gb|A -- -- 1 

610 620 630 640 650 660 

....|....|....|....|....|....|....|....|....|....|....|....| 

NOV3 1 

45 ref |XP_027243 .1 - 1 

gi|l5076B43 |gb| EDSSDQSDLSRASSVQSAHQFSSDSSSSTTSHSCQSPEGRYSALKTKHTHKERGTDSEHT 659 

gi|6650377|gb|A - - 1 

gi|l3171105|gb| - 173 

gi|7290294|gb|A --- 1 

50 

670 680 690 700 710 720 

....|....|....|....|....|.---|....|....|....|....|....|....| 

NOV3 - - - " 1 

ref |XP_027243.1 - " 1 

55 gi 1 15 07 6 84 3 | gb | HKAHLVPEGTSKKRATRRTSSTNSAKTRARVLSLDSGTVACLNDSNRLMAPESI KPLTTS 719 

gi|6650377|gb|A - 1 

gi|13171105|gb| - 173 

gi|7290294|gb|A - 1 

60 730 740 750 760 770 780 

....|....|....|....|....|....|....|....|....|....|....|.--.| 

NOV3 1 

ref|XP 027243.1 1 

gi 1 1507 6843 | gb | KSDLEAKEGEVLDELSLLGRASQLETVTRSRNSLPNQVAFPEGEEQDAVSGAAQASEEAV 779 

65 gij 6650377 |gb| A - 1 

gij 13171105 | gb | EETV 177 

gi 1 7290294 | gb | A - 1 

790 800 810 820 830 840 

70 .... I .... I .... I .... I .... I .... I .... I .... I .... I .... I .... I .... I 
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ref |XP_027243.1 
gi|15076843|gb| 
gi|6650377|gb|A 
gi |l3171105|gb| 
gi|7290294|gb|A 



HOV3 

ref |XP_027243. 



gi 
gi 
gi 
gi 



15076843 |gb| 
6650377 |gb|A 
13171105|gb| 
7290294 |gb|A 
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ref |XP_027243.1 
gi|l5076843|gb| 
gi|6650377|gb|A 
gi|13171105|gb| 
gi|7290294|gb|A 
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ref |XP_027243.1 
gi|l5076843|gb| 
gi|6650377|gb|A 
gi|l3171105|gb| 
gi|7290294|gb|A 
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ref |XP_027243.1 
gi|l5076843|gb| 
gi|6650377|gb|A 
gi j 13171105 |gb | 
gi|7290294|gb|A 
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ref |XP_027243.1 
gi|l5076843|gb| 
gl|6650377|gb|A 
gi|l3171105|gb| 
gl|7290294|gb|A 
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6650377 |gb|A 
13171105|gb| 
7290294 |gb|A 
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gi|l5076843 |gb| 
gl|66S0377|gb|A 
gi|13171105|gb| 
gi|7290294|gb|A 
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J>27243.1 
15076843 |gb | 
6650377 |gb|A 
13171105|gb| 
72902 94 j gb j A 



NOV3 

ref |XP_027243.1 
gi | 15076843 | gb | 
gl|6650377 | gb | A 
gi |13171105 | gb | 
gi |7290294 |gb|A 



NOV3 
ref |XP 



027243.1 



gi 
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gi 



15076843|gb| 
6650377 |gb| A 
13171105 | gb | 
7290294 |gb| A 
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ref |XP_027243.1 
gi|l507 684 3|gb| 
gi|6650377|gb|A 
gi|l3171105|gb| 
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rof |XP_027243.1 
gi|l5076843|gb| 
gi|6650377 |gb|A 
gi|l3171105|gb| 
g±|7290294 |gb| A 



NOV3 

ref |XP_027243.1 
gi|l5076843|gb| 
gi|6650377 | gb | A 
gi|l3171105|gb| 
gi|7290294|gb| A 



NOV3 

re£|XP_027243.1 
gi|l507 684 3|gb| 
gij 6650377 j gb | A 
gijl3171105|gb| 
gi|7290294 |gb|A 



15076843 | gb | 
6650377 J gb | A 
13171105 | gb | 
7290294|gb|A 
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gl|7290294|gb|A QG- 



gi 
gi 
gi 
gi 



15076843 |gb| 
6650377 |gb| A 
13171105 | gb | 
7290294 |gb| A 



NOV3 

rof |XP_027243.1 
gi| 15076843 |gb| 
gl |6650377 |gb| A 
gij 13171105|gb| 
gi|7290294|gb|A 



NOV3 

re£|XP_027243.1 
gi|l5076843|gb| 
gl|6650377|gb|A 
gi|13171105|gb| 
gi|7290294|gb|A 



NOV3 

rof |XP_027243.1 
gl | 15 07 6843 | gb | 
gij 6650377 |gb| A 
gi|1317110S|gb| 
gi|7290294|gb|A 



NOV3 

ref |XPJ>27243.1 
gl | 1507 6843 | gb | 
gi|6650377|gb|A 
gi|l3171105|gb| 
gi|7290294|gb|A 
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NOV3 

rof |XP_027243.1 
gi | 15076843 |gb| 
gi |6650377|gb|A 
gi|l3171105|gb| 
gi|7290294|gb|A 



NOV 3 

rof |XP_027243.1 
gi|l5076843|gb| 
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KOV3 

rof |XP_027243.1 
gi| 15076843|gb| 
gij 6650377 |gb|A 
gl|13171105|gb| 
gi|7290294|gb|A 



NOV3 

rof |XP_027243.1 
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SSLVTLCYGLCVLGRRALGTASHHMSSNLESFLYGLHALFXCDFRISS0RDEWIFAOMEL 

sslvtlcyglcvlgrralgtashhmssnlesflyglhalfkgdfrissHrdewifadmel 
sslvtlcyglcvlgrralgtashhmssnlesflyglhalfkgdfrissHrdewifadmel 

SSLVTLCYGLCVLGRRALGTASHHMSSNLESFLYGLHALrKGDFRISsJjRDEWrFADMEL 
sBLVgLCfiGLCVLGRRALGTAgHgMSSNLESFLYGLHALFKGDFRISSaRDEWIFADMEL 
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NSPSLLALRHVMDDGTNEYKIIMLNRRYLSFRVIKVNKECVRGLWAGQQQELVFLRMRNF 
NSPSLLALRHVMDDGTNEYKIIMLNRRYLSFRVIKVNKECVRGLWAGQQQELVFLRNRNF 

nspsllalrhvmddgtneykiimlnrrylsfrvikvnkecvrglwagqqoelvflrnrnp! 

^PSLLAL RHVffDgGTNEYKIIMLN RRYLSFRVIKVNKECVRGLWAGOnnPT.UFT.PMPMD 1 
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ERGS IQNAKQALRMM I NSSCDQPIGYPr FVSPLTTSYSDSH'3qlk[?ILGGPI SLGNiRNr 

ERGSIQNAKQALRNMINSSCDQPIGYPrFVSPLTTSYSDSH^QLKif ILGGPISLGNIRNF 
ERGS rONAKGALRNMr\*S??rnnPTfiVDT n/CDr TTcvcncuSr\t JSti ^/-^t^t 



ERGSrQNAKQALRNMINSSCDQPIGYPIFVSPLTTSYSDSHiSQLKffllLGGPISLGNIRNtr 
ERGSlONAKOALRWMINSSCDQPIGYPrFVSPLTTSYSDSH«?QLK@ILGGPISLGNIRNF 
ERGSlQNAKQALRWMI WSSC DQPI GYPl j ^SPLTTSY ^SHBQUgSTLGGPIsBGMIRNF 
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GjjGLHPPjTSYPPTLGTSHSSHSVQSGLVRQS PARAS VASQSsHyCYSsBrHSSLRiYSTT 

gqglhppBtsypptlgtshsshsvqsglvrqs parasvasqssBycyssHrhsslrmstt 
gjJglhpp^sypptlgtshsshsvqsglvros paras vasqssBycyssbrhsslrmsit 

G^GLHPFQ TSYPPTI^ TSHSgHSVQSgLVROSPAPJ\StTlASCS5BYCYS5;lRHc;c;T.P^QTT 
GTSg3SgSVQSGLVKgjSPARASVgjSQSS§YgYSsgRHSSLR5STT 
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gl|6650377|gb|A 
gi| 13171105|gb| 
gi|7290294|gb|A 



N0V3 

rof |XP_027243.1 
gi | 15076843 |gb | 
10 gri j 6650377|gb|A 
gi|l3171105|gb| 
gi|7290294|gb|A 



15 

NOV3 

raf |XP_027243.1 
gl 15076843 |gb| 
gi 6650377|gb|A 
20 gi 13171105 |gb| 
gi 7290294|gb|A 



25 NOV3 

re£|XP_027243.1 
gi|l5076843|gb| 
gi | 6650377 | gb | A 
gi|13171105|gb| 

30 gi|7290294|gb|A 



Pecanex gene was originally discovered in Drosophila, encoding a large, membrane- 
spanning protein. The mouse bomolog was recently reported. In the absence of maternal 

35 expression of the pecanex gene, the embryo develops severe hyperneuralization similar to that 
characteristic of Notch mutant embryos. Early gastnila embryos, lacking both maternally and 
zygotically expressed activity of the neurogenic pecanex locus, are shown to contain a greater 
than wild-type number of stably determined neural precursor cells which can differentiate into 
neurons in culture. Therefore it is anticipated that this novel human pecanex will be involved 

40 in neuronal differentiation, maintenance of neuronal precursors and neurological diseases. 

The disclosed NOV3 nucleic acid of the invention encoding a Human homolog of the 
Drosophila pecanex protein includes the nucleic acid whose sequence is provided in Table 3A 
or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 3 A while still 

45 encoding a protein that maintains its Human homolog of the Drosophila pecanex activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 

50 thereto, whose structures include chemical modifications. Such modifications include, by way 
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EGINLSKRKELQWPDEGIRLKAGRNSWKDWSPQEGMEGHVI HRWVPCSRDPGTRSHI DK 
EGIKjSSKRKELjJWPDEGTRLKAGRNSWKDWSPQEGMEGHViJiHRWVPCSRDPgTRSHIDtJ 
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gi|136S3970|ref |XP 

009546.3) 

(XM_009546) 


serine/threo 
nine kinase 
15 [Homo 
sapiens) 


403 


369/403 
(91%) 


381/403 
(93%) 


0.0 


gi|4507275|ref |NP 0 
03591. 1| 
(NM_003600) 


serine/threo 
nine kinase 
15; 

Serine/threo 
nine protein 
kinase 15 
(Homo 
sapiens] 


4 03 


369/403 
(91%) 


380/403 
(93%) 
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gi|744641l|pir| | JC5 
974 


aurora - 
related 
kinase 1 (EC 
2.7.-.-) - 
human 


403 
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gi | 4507279 |ref|NP 0 
03149. 1| 

/ MM flAll CO \ 
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nine protein 
kinase-6; 
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-like) (Homo 
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The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 4D. 



Table 4D. ClustalW Analysis of NOV4 

1) N0V4 (SEQ ID NO: 14) 

2) gi | 12654873 |gb| AAH01280 . 1 | AAH01280 (BC001280) serine/threonine kinase 15 (Homo 
sapiens) (SEQ ID NO: 47) 1 

3) gi|l3653970|ref|XP_009546.3| (XM_009546) serine/threonine kinase 15 (Homo 
sapiens) (SEQ ID NO: 48) 

4) gi|4507275|ref |NP_003591.1| <NM_003600> serine/threonine kinase 15; 
Serine/ threonine protein kinase 15 [Homo sapiens) (SEQ ID NO-49) 

NO-50^ 7446411 ' Pir " JC5974 aurora_related kinase 1 (EC 2.7.-..) - human {SEQ ID 
6) gi|4507279|ref | NP_003 149 . 1 1 (NMJ)031S8> serine/threonine kinase 6- 
Serine/ threonine protein kinase-6; serine/ threonine kinase 6 (aurora/IPLl -like) 
(Homo sapiens] (SEQ ID NO: 51) M 11Ke; 



N0V4 

gi | 12654873 | 
gi |13653970| 
gi |4507275| 
gi j 7446411 1 
gi |4507279| 



N0V4 

gi |12654873 | 
gi|l3653970| 
gi j 4507275 | 
gi|7446411 j 
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PVGGP^iVLVTQQFPCQHPLPSNSGQAOjSVLCP 



MDRSKENCISGPVKATAPVGGPKRVLVTQOFPCQNPLPVNSGQAQRVLCP 
MDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSCQAQRVLCP 

mdrskencisgpvkatapvggpkrvlvtqqQpcqnplpvnsgqaqrvlcp 

MDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQMPLPVNSGQAQRVLCF 

mdrskemcisgpvkatapvggpkrvlvtqqfpcqmplpvnsgoaorvlcp 
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SNSSQRVPLQAQKLVSSHKPVQN•QKQKQLQATSVPHPj2SRPL^!^3TCaSKO 
SNSSQRVPLQAQXLVSSHKPVQN'QKQKQLQATSVPHPVSRPLNNTQKSKQ 
SNSSQP.gPLQAQKLVSSHKPVQ>3QKQKQLQATSVPHPVSRPLNNTQKSKC 
SNSSQRVPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSRPLNNTQKSKg 

smssqp.Rplqaoklvsshkpvq^qkqkqlqEtsvphpvsrpl^ntqkskq 
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gi|4507279| 1 



NOV4 



gi 

gi 
gi 



126548731 

13653970| 

4507275) 

7446411) 

4507279| 



K0V4 

gi| 12654873| 
gi|l36S3970| 
gi|4507275| 
gij 7446411| 
gi|4507279| 



N0V4 

gi|126S4873| 

gijl3653970| 

gi|4507275| 

gi|7446411| 

gi|4507279| 



NOV4 

gi | 12654873 | 
gij 13653970] 
gi|4S07275| 
gi|744641l| 
gi|4507279| 



NOV4 

gi | 12654873 | 

gi|l36S3970| 

gi|4507275| 

gi|744641l| 

gi|4507279| 



N0V4 

gi|12654873| 

gi|l3653970| 

gi|4S0727S| 

gi|7446411| 

gi|4507279| 



NSSQRVPLOAQKLVSSHKPVQNQKQKQLQATSVPHPVSRPLMKTOKSKC 
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sapemnpseelaskqkneeskkrqwaledJeigrpEgkgkfgnvyla 
plpsapennpeeelaskqkneeskkrqwaledfeigrplgkgkfgnvyla 

PLPSAPENNPEEELASKQXNEESKKRQWALEDFEIGRPLGKGKFGNVYLA 
PLPSAPEMNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGXFGNVYLA 
PLPSAPE NNPEEELASKQ K NEESK KRQWALEDFEIGRPLGKGKFGMVYLA 
PL?$ii^r^t<ij;|;l^Ut?^Ka^>^»:K^^ALEDFEIGRPLGKGKFGr^YLA 
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REKQSKFILALjgVLFKAOLEKAGVEHOLRREVEIOSHLgHPMIURLYGYF 
REKQSKFILALKVLFKAQLEKAGVEHQLRREVE10SHLRHPNI1RLYGYF 
REKQSKF1LALKVLFKAQLEKAGVEHQLRREVEIQSHLRHPNILRLYGYF 
REKQSKF1LALKVLFKAQLEKAGVEHQLRREVEIQSHLRHPNILRLYGYF 

rekqskSilalkvlfkaqlekagveholrreveioshlrhpnilrlygyf 
rekqskftlalkvlficaqlekagveholrrevetoshlrhpnilrlygyf 
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HDATRVYLILEYoPL[»TVjJ|jELQKLSKFDEORTATYITEL,AgALSYCHSK 
HDATRVYLILEYAPLCTVYRELQKLSXFDEQRTATYI7ELANALSYCHSK 
HDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATYITELANALSYCHSK 
KDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATYITELANALSYCHSK 
HDATRVYLILEYAPLCTVYRELQKLSKFDEQRTA TYITELA NALSYCHSK 
HDATRVYLILEYAPLGTVYRELOKLSKFDEORTA<3i5553ijANALSYCHSK 
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270 
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290 
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IHRDIKPENLLLGSAGEL^IA'JjFGWSgHAPSSKRTTLCGTLDYLPPEK 
RVIHRDIKPENLLLGSAGELKIADFGWSVHAPSSRRTTLCGTLDYLPPEM 
RVIHRDIKPENLLLGSAGELKIADFGWSVflAPSSRRTTLCGTLDYLPPEW 
RVIHRDIKPENLLLGSAGEIjKIADFGWSVHAPSSRRTTLCGTLDYLPPEK 
RVIHRDIKPENLLLGSAGELKIADFGWSVHAPSSRRTTLCGTLDYLPPEM 
RVTHRDIKPENLLLGSAGELKIADFGWSVHAPSSRRTTLCGTLDYLPPE.v 
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340 
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350 
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iegrmhdekvdlwslgvlcgeflvgkppfeantyqetykrisrveftfpd 
iegrmhdekvdlwslgvlcyeflvgkppfeantyqetykrisrveftfpd 
iegrmhdekvdlwslgvlcyeflvgkppfeantyqetykrisrveftfpd, 
iegrmhdekvdlwslgvlcyeflvgkppfeantyqetykrisrveftfpd 
iegrmhdekvdlwslgvlcyeflvgkppfeantyqetykrisrveftffd! 
iegrmhdekvdlwslgvlcyeflvgkppfeantyoetykrisrveftfpd, 
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380 
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400 
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NOV4 

gi | 12654873 | 
gi |l3653970| 
gij 4507275 | 
gi |7446411| 
gi|4507279| 



FVTEGARDLISRLLKK^PSQRPMLKEVLEyPWITANSSKPSNCQNKESuS 
FVTEGARDLISRLLKHNPSQRPMLREVLEHPWITAMSSKPSNCQNKESAS 
FVTEGARDLISRLLKHNPSQRPMLREVLEHPWrTANSSKPSNCQNKESAS 
FVTEGARDLTSRL.LKHNPSQRPMLREVLEHPWITANSSKPSNCQNKESAS 
FVTEGARDLISRLLKHNPSQRPMLREVLEHPWrTANSSKPSNCQNKESAS 
FVTEGARDLISRLLKHNPSQRPMLREVLEHPWTTATJSSK PSNCQNKESAS 



65 
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Table 5B. NOV5 protein sequence (SEQ ID NO:16) 

MGQSQGD^ 

VEEFIRNQEQIKLLEEKQEEGRSKVDDLRGTPMSVGNLEEIIDDNHAIVSTSVGSEHYDSIISFVEKDLLEPGC 
S I LLRHKVHA V IG VLMDDTG PLVTMM KVE KA PQE TYVNTGGLDNQ I QE I KES MEL PL PH PE YY EEMGTKP P KGV 
ILCGPPGTGKTLLAKAVANQTSATFLRWGSELIQKYLGDGPKLVRQVFQVAEEHAPSIMFTDEIBAIGTKRYD 
SNSGGEREIQQTMLELELLNQLGGFDSREDVKVIMATKQVETLDPVLIRPGRIDKKIEFHLPDEKTKKHIFQIH 
TS RMTLANDVTLDDL I MA KDDFS GAD I KA I CTEAGLMAL R EHRMKATN ED FKKS I ES VL Y KKH EG I PEGL Y L 



The full amino acid sequence of the protein of the invention was found to have 383 of 
442 amino acid residues (86%) identical to, and 405 of 442 amino acid residues (91%) similar 
to, the 440 amino acid residue ptnr:SWISSPROT-ACC:P49014 protein from Mus musculus 
5 (Mouse), and Rattus norvegicus (Rat) (26S Protease Regulatory Subunit 4 (P26S4) (E = 1 .7e" 

200j 

NOV5 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table SC. 



Table 5C. BLAST results for NOV5 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi|4506207|ref |NP 0 

02793.1) 

(NM_002802) 


proteasome 
(prosome , 
rnacropain) 26S 
subuni t , AT Pa se , 
1; Proteasome 26S 
subunit, ATPase, 1 
(Homo sapiens] 


440 


382/442 
(86%) 


405/442 
(91%) 


0.0 


gi|6679S0l|ref |NP 0 
32973. 1| 
(NMJ)08947) 


protease (prosome, 
rnacropain) 26S 
subunit, ATPase 1 
(Mus musculus] 


440 


383/442 
(86%) 


405/442 
(90%) 


0.0 


gi|34S717|pir| |A444 
68 


2 6S proteasome 
regulatory chain 4 
(validated] - 
human 


440 


381/442 
(86%) 


404/442 
(91%) 


0.0 


gi|l6741033|gb|AAHl 

6368.1|AAH16368 

(BC016368) 


protease (prosome, 
rnacropain) 26S 
subunit, ATPase 1 
(Homo sapiens] 


440 


382/442 
(86%) 


404/442 
(90%) 


0.0 


gi|2492S16|Bp|Q9073 
2 | PRS4_CHICK 


26S PROTEASE 
REGULATORY SUBUNIT 
4 (P26S4) 


440 


378/442 
(85%) 


402/442 
(90%) 


0.0 



10 

The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 5D. 

Table 5D ClustalW Analysis of NOV5 

1) N0V5 (SEQ ID NO: 16) 
15 2) gi|4506207|ref |NP_002793 . l| (NM_002802) proteasome (prosome, rnacropain) 26S 

subunit, ATPase, 1; Proteasome 26S subunit, ATPase, 1 [Homo sapiens] (SEQ ID NO: 52) 

3) gi|667950l|ref |NP_032973.1| (NM_0OB947) protease (prosome. rnacropain) 26S 
subunit, ATPase 1 [Mus musculus] (SEQ ID NO: 53) 

4) gi| 345717|pir | |A44468 26S proteasome regulatory chain 4 (validated] - human (SEQ 
20 ID NO: 54) 

71 
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5) gi | 16741033 | gb | AAH16368 . 1 | AAH16368 (BC016368) protease (prosome, macropain) 26S 
subunit, ATPase 1 [Homo sapiens] (SEQ ID NO: 55) 

6) gi| 2492516 |sp|Q90732|PRS4_CHICK 26S PROTEASE REGULATORY SUBUNIT 4 (P26S4) (SEQ 
ID NO: 56) 



10 



15 



20 



NOV5 

gi |4506207| 
gi|667950l| 
gi|345717| 
gi j 16741033 | 
gi|2492S16| 



NOV5 

gi 
gi 
gi 
gi 
gi 



4506207| 
667950l| 
34S717| 
16741033| 
2492516) 



10 20 30 40 50 
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MGQSO^GHGPf^GKKDjaKDKKgKYEPjJVPTRVg^Kg^KTKC-gDflASKLF 
MGQSQSGGHGPGGGKKDDgDKKKKYEPPVPTRVGKKKKKTKGPDAASKLP 
MGQSQSGGHGPGGGKKDDKDKKKKYEPPVPTRVGKKKKK7KGPDAASKLF 

mgqsqsgghgpgggkkddgdkkkkyeppvptrvgkkkkk(2kgpdaasklp 
mcqsqsgghgpgggkkddkdkkkkyeppvptrvgkkkkk7kgpdaasklp 
mgosqsgghgpgcgkkddkdkkkkyeppvptrvgkkkkktkgpdaaskEf 
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liVTjHTQCRLKLLKiERIKDYLLMjJjEEFIRNQEQSKjjLEEKQSEgRSKVC 
LVTPHTQCRLKLLKLERIKDYLLMEEEFIRNQEQMKPLEEKQ3EERSKVC 
LVTPHTQCRLK^LKLERIKDYLLMEEEFIRNQEQMKPLEEKQEEERSKVC 
LVTPHTQCRLKLLKLERIKDYLLMEEEFIRNQEOMKPLEEKQEEERSKVC 
LVTPHTQCRLKLLKLERIKDYLLMEEEFIRNQEQMKPLEEKQEEERSKVC 
LVTPHTQCRLKLLKLERIKDYLLMEEEFIRNOEQMKPLEEKOEEERSKVC 



25 



30 



NOV5 

gi|4506207| 
gi j 6679501 | 
gi j 345717 | 
gi | 16741033 | 
gi I 2492516 | 



.1 
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130 
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140 



150 



DLRGTPMSVGTLEEI IDDNHAI VSTSVGSEHYgSl ySKV^KDLLEPGCS 
DLRGTPMSVGTLEEI I DDNHAIVSTSVGSEH YVSILS FVDKDLLEPGCSV 
DLRGTPMSVGTLEEI IDDNHAIVSTSVGSEHYVSILSFVDKDLLEPGCSV 
DLRGTPMSVGTLEEI I DDNHAIVSTSVGSEH YVSILS FVDKDLLEPGCSV 
DLRGTPMSVGTLEEI IDDN^IVSTSVGSEHYVSILS FVDKDLLEPGCSV 
DLRGTPMSVGTLEEI I DDNHAIVSTSVGSEH YVST LSFVDKDLLEPGCSV 



35 



40 



NOVS 

gi|4506207| 
gi |6679S0l| 
gi (345717) 
gijl6741033| 
gi |2492516| 
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160 



I 



170 



180 



190 
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LLgHKVHAVIGVLMDDTgPLVTjjMKVEKAPQETYiSiQGGLDNQIQEIKES 
LLNHKVHAVIGVLMDDTDPLVTVMKVEKAPQETYADIGGLDNQIQEIKES 
LLNHKVHAVIGVLMDDTDPLVTVMKVEKAPQETYADIGGLDNQIQEIKES 

llnh kvhavigvlmddtdplvtvmkvekapqetyadi ggldnqi qeikes 
llnhkvhavigvlmddtdplvtvmkvekapqetyadiggldnqiqeikes 
llnhkvhavigvlmddtdplvtvmk2ekapqetyadiggldnqiqeikes 
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NOVS 

gi|4506207| 
gi j 6679501 j 
gi|345717| 
gi |16741033 | 
gi|2492S16| 



NOV5 

gi|4506207| 

gi|6679S0lj 

gi|34S717| 

gi|16741033| 

gi|2492516| 



NOV5 

gi|4506207| 
gij 6679501 j 
gi|345717| 
gij 16741033 | 
gi|2492516| 
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220 
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230 



240 



I 



250 
I 



^elplqhpeyyeemgokppkgvilggppgtgktllakavanqtsatflrv 
velplthpeyyeemgikppkgvilygppgtgktllakavanqtsatflrv 
velplthpeyyeemgikppkgvilycppctgktllakavanqtsatflrv 
velplthpeyyeemgikppkgvilygppgtgktllakavanqtsatflrv 
velplthpeyyeemgikppkgvilygppgtgktllakavanqtsatflrv 
velplthpeyyeemgikppkgvilygppgtgktllakavanqtsatflr 
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VGSELIQKYLGDGPKLVRg2jF£VAEEHAPSIjjFuDEI^IGTKRYDSN T SG 
VGSELIQKYLGDGPKLVRELFRVAEEHAPSIVFIDEIDAIGTXRYDSNSG 
VGSELIQKY LGDGPKLVRELFRVAEEHAPSIVFIDEIDAIGTKRYDSNSG 
VGSELIQKYLGDGPKLVRELFRVAEEHAPSIVFIDEIDAIGTKRYDSNSC 
VGSELIQKYLGDGPKLVRELFRVAEEHAPSIVFIDEIDAIGTKRYDSNSG 

gseliqkylgdgpklvrelfrvaeehSpsivfideidaigtkrydsnsg 
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^KIEFgLPDEKTKKjJlIFQIHTSRMTLAjjDVTLDDLIjMAKDDSSGADIKAl 
RKIEFPLPDEKTKKRIFQIHTSRMTLADDVTLDDLIMAKDDLSGADIKAI 
RKIEFPLPDEKTKKRIFQIHTSRMTLADDVTLDDLIMAKDDLSGADIKAI 
RKI EFPLPDEKTKKRI FQIHTSRMTLADDVTLDDLI MAKDDLSGADI KA I 
RKIEFPLPDEKTKKRIFQIHTSRMTLADDVTLDDLIMAKDDLSGADIKAI 
RKIEFPLPDEKTKKRtFQIHTSRMTLADDVTLDgLTMAKDDLSGADIKA] 



10 



15 



NOV 5 

gi| 4506207 | 
gi j 6679501 j 
gi|345717| 
gi|16741033| 
gij 2492516| 



NOV5 

gi|4506207| 
gi|6679S0l| 
gi|345717| 
gi|l6741033| 
gij 2492516 | 



J. 



410 



1 



420 



430 



440 



CTEAGLMALREjJjRMKgTNEDFKKSgEgVLYKKjgEGgPEGLYL 
CTEAGLMALRERRMKVTNEDFKKSKEMVLYKKQEG7PEGLYL 
CTEAGLMALRERRMKVTNEDFKKSKENVLYKKQEGTPEGLYL 
CTEAGLMALRERRiXKVTNEDFKKSKEHVLYKKQEGTPEGLYL 

cteaglmalrerrmkvtnedfkksker-a/lykxqegtpeglyl 
cteaglmalrerrmkvtnedfkkskenSlykkmegtpeglyl 



20 



Tables 5E-F list the domain description from DOMAIN analysis results against NOV5. 
This indicates that the NOV5 sequence has properties similar to those of other proteins known 
to contain this domain. 



Table 5E. Domain Analysis of NOV5 

gnl | Pfam i pfam00004 , AAA, ATPase family associated with various 
cellular activities (AAA) . AAA family proteins often perform 
chaperone-like functions that assist in the assembly, operation, or 
disassembly of protein complexes (SEQ ID NO: 101) 
CD-Length = 186 residues, 100.0% aligned 
Score «• 190 bits (483), Expect » le-49 





NOV 4: 


221 


25 


Sbjct : 


1 




NOV 4: 


281 




Sbjct: 


61 


30 






NOV 4: 


341 




Sbjct: 


117 


35 


NOV 4: 


401 




Sbjct: 


177 



GVILCGPPGTGKTLLAKAVANQTSATFLRWGSELIQKYLGDGPKLVRQVFQVAEEHAPS 280 

I II t! Ml III! I II ♦ |+ + Mil* M + I+ 1 1 1 1 +1 +1 + 1 1 

GI LL YG P PGTGKTLLAKAVAKELGVP F I E I SGS ELLS KYVGES E KLVRAL FS LARKS APC 60 

IMFTDEIEAIGTKRYDSNSGGEREIQOTMLELELLNQLGGFDSREDVTCVIMATKQVETLD 340 

M MM* II I +1 ♦ +11 11+ +1 IMI + ♦ II 

IIFIDEIDALAPKRGDVGTGDVSS R WNQLLTEMDGFE KLSNVI V I GATNR P DLLD 116 

P VL I R PGR I D KK I E FHLPDE KTKKH I FQ I HTS RMTLANDVTLDDL I MA KD D FS GAD I KA I 400 

I Mill I++II + * I +11 + I II 1 + 

PALLRPGRFDRRIEVPLPDEEERLEILKIHLKKKPLEKDVDLDEIARRTPGFSGADLAAL 176 



I II I hi 
CREAALRAIR 186 



Table 5F. Domain Analysis of NOV5 

gnl | Smart | smart00382 , AAA, ATPases associated with a variety of 
cellular activities; AAA. This profile/alignment only detects a 
fraction of this vast family. The poorly conserved N-terminal helix is 
missing from the alignment. (SEQ ID NO: 102) 
CD-Length = 151 residues, 100.0% aligned 
Score = 61.6 bits (148), Expect = 9e-ll 



40 NOV 4: 218 PP KGV I LCGP PGTGKT LLAKA VANQTSAT FL RW GSELIQK 258 

I + I" llll + lll ♦ I* I 

Sbjct: 1 PGEIVVljIVGPPGSGKTTIJlRALARELGPDGGGVIYIDGEDLREEALl#QLLRLLVLVGEDK 60 



73 
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The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 6D. 



Table 6D Clustal W Sequence Alignment 
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1) NOV6 (SEQ ID NO: 18) 

2) 9 i|3077703|dbj|BAA2S784.l| (AB004816) mit8U9umin29 (Oryccolagua cuniculusj (SEQ 
ID NO: 57 J 

3) gi| 6678874 |ref | NPJ)32622 . 1 | <NM_00B596) mitsugumin 29 [Mus musculus) (SEQ ID 
NO: 58) 

4) gi| 12836843 | dbj | BAB2383 1 . 1 | (AK005132) putative (Mus musculus] (SEQIDNOS9) 

5) gi| 1351168 |sp|P20488|SYPH_BOVIN SYNAPTOPHYSIN {MAJOR SYNAPTIC VESICLE PROTEIN 
P38) (SEQ ID NO:60> 

6) gi|2134413|pir| |I50720 synaptophysin Ha - chicken (SEQ ID NO:61) 
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NOV6 

gi | 3077703 | 
gi | 6678874 | 
gijl2836843| 
gi|l351168| 
gi|2134413| 



NOV6 

gi|30777O3| 
gi | 6678874 j 
gij 12836843) 
gi |1351168| 
gi|2134413| 



NOV6 

gi | 3077703 | 
gij 6678874 | 
gi | 12836843 | 
gi|l35116B| 
gi (2134413 j 



N0V6 

gi | 3077703 | 
gi|6678874| 
gij 12836843 | 
gij 1351168 | 
gi|2134413| 



NOV6 



9i 
9i 

gi 
gi 



3077703| 
667B874| 
12836B43| 
1351168| 



|....|....| 
MSSTESAGRTADKSPRQi 
MSSTESPSRAADKSPRQ< 
MSSTESPGRTSDKSPRQi 

-MDpSsjy 



NOV6 




DF 


gi|3077703| 




DF 


gi|6678874 j 




DF 


gij 12836843) 




DF 


gijl351168| 




DF 


gi|2134413 j 


m 


DF 




YGSSjSYS QQAN L 

YGP^gS YQPDYGQPAS SGGGGY 

78 



gi|2134413 I 



N0V6 

gi | 3077703 | 
gi | 6678874 j 
gi j 12836843 | 
gi|l351168| 
gi|2134413 j 



-SSGYNE3--GVN-gjDgYGPAgGYN QPGS- 



310 



320 



GPTSDEFGQQP SGP' 

G PQG - D YGQQG YGPQGA P' 
GQVG - DYGQPQS YGQSG1 




Table 6E lists the domain description from DOMAIN analysis results against NOV6. 
This indicates that the NOV6 sequence has properties similar to those of other proteins known 
to contain this domain. 



Table 6E. Domain Analysis of NOV6 

gnl|Pfam|pfam01284, Synaptophysin, Synaptophysin / synaptoporin. (SEQ 

ID NO : 1 03 ) 

CD-Length • 298 residues, 70.8% aligned 
Score « 244 bits (622), Expect » 6e-66 



Sbjct: 3 
NOV S: 89 



NOV 5: 29 RRLEEPLGFIKVLQWLFAIFAFGSCGSYSGETGAMVRCNNEAKDVSSIIVAFGYPFRLHR 88 

♦ HIMIIIMIIIII +11 1 1 1 1 Ilk. *| +| | nun 

MVIFAPl^l^KVLQWVFAIFAFATCGGYSGELQLSVDCANKTESDLNIDIAFAYPFRLHB 62 

IQYEMPLCDEESSSKTMHLMGDFSAPAEFFVTLGIFSFFYTMAALVIYLRFHNLYTENKR 148 
+ +1 I I I I + 1*1 I I* I I I I I |+ + l I — HI | + I I I II + 
Sbjct : 63 VTFEAPTC-EGDEKKNIALVGDSSSSAEFFVTVAVFAFLYSIJVA1JVTYIFFQNKYRENNK 121 

NOV S: 149 FPLVDFCVTVSFTFFWLVAAAAWGKGLTDVKGATRPSSLTAAJMSVCHGEEAVCSAGATPS 208 

IMI I I I III ++M IIMII II I - I || | | 

Sb^ct : 122 GPLIDFIATAVFAFLWLVGSSAWAKGLSDVKMATDPEEIIKGMHACHQPGNKCKELHDPV 181 

NOV 5: 209 MGLANISVLFGFINFFLWAGNCWFVFKETPWH 240 

CK . I I H*llhll Mill lllllll I 

Sbjct: 182 MSGLNTSWFGFliNFI LWAGNI WFVFKETGWA 213 

In skeletal muscle, excitation-contraction (E-C) coupling requires the conversion of the 
depolarization signal of the invaginated surface membrane, namely the transverse (T-) tubule, 
to Ca2+ release from the sarcoplasmic reticulum (SR) (Takeshima H et al., Biochem J 1998 
Apr 1;33 1 ( Pt l):317-22 / PMID: 9512495, Ul: 98180964). Signal transduction occurs at the 
junctional complex between the T-tubule and SR, designated as the triad junction, which 
contains two components essential for E-C coupling, namely the dihydropyridine receptor as 
the T-tubuJar voltage sensor and the ryanodine receptor as the SR Ca2+-release channel. 
However, functional expression of the two receptors seemed to constitute neither the signal- 
transduction system nor the junction between the surface and intracellular membranes in 
cultured cells, suggesting that some as-yet-unidentified molecules participate in both the 
machinery. In addition, the molecular basis of the formation of the triad junction is totally 
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Table 7B. Encoded NOV7 protein sequence (SEQ ID NO:20). 

LTGREVLTP FPGLGTAAAP AQGG AHL KQCDL LKLSRRQKQLCR R E PGIiAETL RDAAHLGLLECQFQ FRHER WNCS 
LEGRMGLLKRGFKETAFLYAVSSAALTHTLARACSAGRMERCTCDDSPGLESRQAWQWGVCGDNLKYSTKFLSNF 
LGSKRGNKDLRARADAHimiVGIKAVKSGLRTTCKCHGVSGSCAVRTCWKQLSPFRETGQVLKLRYDSAVKVSSA 
TNEALGRLELWAPARQGSLTKGLAPRSGDLVYMEDSPSFCRPSKYSPGTAGRVCSREASCSSLCCGRGYDTQSRL 
VAFSCHCQVQWCCYVECQQCVQEELVYTCKH 



The disclosed NOV7 amino acid sequence has 205 of 330 amino acid residues (62%) 
identical to, and 252 of 330 amino acid residues (76%) similar to, the 354 amino acid residue 
ptnr:SWISSPROT-ACC:O42280 protein from Gallus gallus (Chicken) (WNT-14 Protein 
5 Precursor) (E = 1.3e~ 114 ). 

The tissue expression of NOV7 is predicted to be expressed in brain because of the 
expression pattern of (GENBANK-ID: gb: GEN B ANK-ID : AF03 1 168|acc:AF031 168.1) a 
closely related Gallus gallus Wnt-14 protein (Wnt-14) mRNA, complete cds homolog. 

NOV7 also has homology to the amino acid sequences shown in the BLASTP data 
10 listed in Table 7C. 



Table 7C. BLAST results for NOV7 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%> 


Positives 
(%) 


Expect 


gi | 16303264 | db j | BAB 
70499. 1| (AB063483) 


WNT14B [Homo 
sapiens) 


357 


330/331 
(99%) 


330/331 
(99%) 


e-175 


gi|391S306|sp|O4228 
0|WN14 CHICK 


WNT-14 PROTEIN 
PRECURSOR 


354 


204/332 
(61%) 


253/332 
(75%) 


e-109 


gi| 1508226l|ref |NP_ 
003386. 1| 
(NM_003395) 


wingless- type 
MMTV integration 
site family, 
member 14 (Homo 
sapiens] 


365 


209/335 
(62%) 


255/335 
(75%) 


e-108 


gi| 139748|sp|P10108 
| WNT1_XENLA 


WNT-1 PROTEIN 
PRECURSOR (XWNT- 
1) (XINT-1) 


371 


120/313 
(38%) 


175/313 
(55%) 


5e-58 


gi|302485l|sp|O1490 
5|WN15_HUMAN 


WNT-15 PROTEIN 


120 


120/120 
(100%) 


120/120 
(100%) 


2e-56 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 7D. 



1 5 Table 7D Information for the ClustalW proteins 

1) NOV7 (SEQ ID NO: 20) 

2) gi|l6303264|dbj|BAB70499.l| (AB0634B3) WNT14B (Homo sapiens] (SEQ IDNO:62) 

3) gi|3915306|sp|O42280|WN14_CHICK WNT-14 PROTEIN PRECURSOR (SEQ ID NO:63) 

4) gi|l508226l|ref | NP_003386 . 1 | (NM_003395) wingless-type MMTV integration site 
20 family, member 14 (Homo sapiens] (SEQ ID NO:64) 

5) gi|l39748|sp|P10108|WNTl_XENLA WNT-1 PROTEIN PRECURSOR (XWNT-1) (XINT-1) (SEQ ID 
NO:65) 

6) gi|302485l|sp|O14905|WN15_HUMAN WNT-15 PROTEIN (SEQ ID NO:66) 
25 10 20 30 40 50 



NOV7 
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gi|l6303264 J 
gi|3915306| 
gij 15082261| 
gi 1 139748 I 
gi|302485l| 



MRPPPAIALAGL 

MALLRALLG- 

MLDGSPLARWLAAAFGI/ 
MRILTFLLGLI 




ASAG 



10 



15 



20 



25 



30 



35 



40 



N0V7 

gi|l6303264| 
gi|39l5306| 
gi| 1S082261| 
gi|l39748| 
gi|302485l| 



N0V7 

gi I 16303264J 
gi |3915306| 
gi|15082261| 
gi J 139748) 
gi|302485l| 



N0V7 

gi 
gi 
gi 
gi 



16303264 I 
3915306| 
1S082261| 
139748| 
3024851 1 



N0V7 

gi 1 16303264) 
gi|3915306| 
gij 15082261| 
gi|139748| 
gi|3024851| 



60 

• I — - - I 

gtaaa: 
gtaaa: 
tsemeeQ- 

T1>EP 

NVLPGSDARPVPLVLDP^ 



100 





120 

- -Smg 



130 



140 



150 



jPTGTGNyVFGl 




200 




250 



260 



270 



45 



50 



55 



60 



65 



N0V7 

gi I 16303264 I 

gi|3915306| 

gi|l508226l| 

gi|l39748| 

gi|3024851| 



N0V7 

gi 1 16303264) 
gi|3915306| 
gij 15082261| 
gi|139748| 
gi|3024851 I 



N0V7 

gi|l6303264| 
gi|3915306| 
gi|1508226l| 
gi|l39748| 
gij 3024851 1 




280 290 
' -gfflgl'TKG: 

-ksipghsdqi 
IpQgras^ggsdp] 

|pH--tflPENPTI 
340 



300 




350 
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VEAVSMSALECQFQFRFERWNCTLEGRyRASLLKRGFKETAFLYAISSAGLTHAXAKACSAGRMERCTCDE 
APDLENREGWKWGGCSEDIEFGGMVSREFADARENRPDARSAMNRHNNEAGRQVIKAGVETTCKCHGVSGS 
CTVRTWRQlAPFHE^GKHLKHKYESALKVGSTTNEAAGEAGAlSPPRGRASGAGGSDPLPRTPELVHliDD 
SPSFCIJVGRFSPGTAGRRCHREKNCESICCGRGHNTQSRWTRPCQCQVRWCCYVECRQCTQREEVYTCKG 



The disclosed NOV8 amino acid sequence has 270 of 354 amino acid residues (76%) 
identical to, and 310 of 354 amino acid residues (87%) similar to, the 354 amino acid residue 
ptnr:SWISSPROT-ACC;O42280 protein from Gallus gallus (Chicken) (WNT-14 Protein 
5 Precursor (1.2e* 151 ). 

NOV8 is expressed in at least brain. This information was derived by determining the 
tissue sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, Public EST sources, Literature sources, and/or RACE sources. 

In addition, the sequence is predicted to be expressed in brain because of the 
10 expression partem of (GENBANK-1D: gb:GENBANK-ID: AF031 1 68|acc:AF03 1 168.1) a 
closely related [Gallus gallus Wnt-14 protein (Wnt-14) mRNA, complete cds]. 

NOV8 also has homology to the amino acid sequence shown in the BLASTP data 
listed in Table 8C. 



Table 8C BLAST results for NOV8 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
<aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi|lS082261|ref |NP_0 
03386.1) (NM_003395) 


wingless-type 
MMTV integration 
site family, 
member 14 [Homo 
sapiens] 


365 


306/340 
(90%) 


321/340 
(94%) 


e-167 


gi| 3915306 | ep| 042280 
|WN14 CHICK 


WNT-14 PROTEIN 
PRECURSOR 


354 


270/357 
(75%) 


310/357 
(86%) 


e-142 


gi| 16303264 |dbj |BAB7 
0499. 1| (AB063483) 


WNT14B [Homo 
Bapiens) 


357 


193/339 
(56%) 


244/339 
(71%) 


e-ioo 


gi| 7106447|ref |NP_03 
3548.1) (NM_009522) 


wingless-related 
MMTV in teg ra t ion 
site 3A [Mus 
musculus] 


352 


141/311 
(45%) 


179/311 
(57%) 


2e-62 


gi| 5821261 |dbj | BAA83 
743.11 (AB0240BO) 


Wnt-3a [Gallus 
gallus) 


376 


139/311 
(44%) 


179/311 
(56%) 


3e-62 



15 

The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 8D. 

Table 8D. Information for the ClustalW proteins 

1) NOV8 (SEQ ID NO: 22) 
20 2) gi| 15082261 |ref |NP_003386 . 1 | (NM_003395) wingless- type MMTV integration site 
family, member 14 [Homo sapiens) (SEQ ID NO: 64) 

3) gi|3915306|sp|O42280|WN14_CHICK WNT-14 PROTEIN PRECURSOR (SEQ IDN0:63) 

4) gi| 16303264 |dbj |BAB70499.l| (AB063483) WNT14B [Homo sapiens) (SEQ IDN0:62) 

5) gi|7i06447|ref |NP_033548 .1 j (NM_009522) wingless-related MMTV integration site 
25 3A [Mus musculus) (SEQ ID NO: 67) 

6) gi|582126l|dbj |BAA83743.1| (AB024080) Wnt-3a [Gallus gallus) (SEQ ID NO:68) 
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10 



15 



20 



25 



30 



35 



NOV8 

gi |15082261| 
gl|3915306| 
gi I 16303264| 
gi |7106447 j 
gi|5821261| 



NOV8 

gi|l508226l| 
gi|391S306| 
gi|16303264| 
gi j 7106447) 
gi j 5621261 1 



NOVB 

gij 15082261| 

gi|3915306| 

gi|16303264| 

gi|7106447| 

gi|582126l| 



NOV8 

gi| 15082261| 
gi|3915306| 
gi j 16303264 | 
gi |7106447| 
gi |582126l| 




20 30 
QVVCT -c£ 

Jaalr 

CTP 

LA LP 

KQA LG-S^ 

rBsQkqwgwcgwt pmg§aj» 




jEQRgsBSLW 




230 

.1 . . 



240 



250 
. . I 



40 



45 



50 



55 



60 



65 



70 



NOV8 

gl|lS08226l| 
gi|39lS306| 
gij 16303264) 
gi |7106447| 
gi j 5621261| 



KOV8 

gi 1 150B2261 | 

gi|391S306| 

gi|16303264| 

gi|7106447| 

gi|582126lj 



NOV8 

gij 1S082261| 
gi | 3915306( 
gij 16303264| 
gi|7106447| 
gi|5621261| 



NOV8 

gi| 15082261| 
gi|3915306| 
gi j 16303264 | 
gi|7106447| 
gi|5621261| 



I KgcI^TTCKCKGVSGSCllVRTClNflQL 

i k^R^ckchgvsgsciivrtcw2ql 
SkBg^SttckchgvsgscSvrtcwSol 

I ^RtJjJJCKCHG«2SGSCgvjjTC^ 



300 



290 

GRASGAGCSDpg 
3 RASGAGGS D P D 

-KSIPGHSDQa 
-Q-GSLTKGLA 

-Y TYPKg 

d-.y NFFKA 




350 
I 



96 



i,-jv lit* »i.tO» Histr 4 r Ttt-p •uii 1 "T. »n fin^ »-iki 5*.? i&w 



The disclosed NOV9 polypeptide has homology to the amino acid sequences shown in 
the BLASTP data listed in Table 9C. 



Tabic 9C BLAST results for NOV9 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


•ptnr:pir-id:A53791 


beta- adrenergic - 
receptor kinase 
(EC 2.7.1.126) 1 
- human 


689 


495/497 
(99%) 


495/497 
(99%) 


0.0 


ptnr:SWISSPROT- 
ACC:P25098 


Beta-adrenergic 
receptor kinase 1 
(EC 2.7.1.126) 


689 


494/497 
(99%) 


495/497 
(99%) 


0.0 


ptnr : SPTREMBL- 
ACC.Q99LL8 


SIMILAR TO 
ADRENERGIC, BETA, 
RECEPTOR KINASE 1 
- Mus musculus 


687 


490/495 
(98%) . 


493/495 
(99%) 


0.0 


ptnrrSWISSPROT- 
ACC:P26817 


Beta-adrenergic 
receptor kinase 1 


689 


489/497 
(98%) 


493/497 
(99%) 


0.0 


ptnr : SPTREMBL- 
ACC:Q99MK8 


G PROTEIN 
RECEPTOR KINASE 2 


689 


490/497 
(98%) 


494/497 
(99%) 


0.0 ! 



5 The homology between these and other sequences is shown graphically in the 

ClustalW analysis shown in Table 9D. In the ClustalW alignment of the NOV9 proteins, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
10 can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 9D. ClustalW Analysis of NOV9 

1) NOV9 (SEQ ID NO: 24) 
15 2) ptnr:pir-id:A53791 beta -adrenergic -receptor kinase (EC 2.7.1.126) 1 (SEQ ID 

NO:69) 

3) ptnr:SWISSPROT-ACC:P2 5098 Beta-adrenergic receptor kinase 1 (EC 2.7.1.126) 
(SEQ ID NO:70) 

4) ptnr : SPTREMBL-ACC:Q99LL8 SIMILAR TO ADRENERGIC, BETA. RECEPTOR KINASE 1 - Mus 
20 musculus (Mouse) (SEQ ID NO: 71) 

5) 6) ptnr:SWISSPROT-ACC:P26817 Beta-adrenergic receptor kinase 1 (EC 2.7.1.126) 
(Beta-ARK- 1) (SEQ ID NO:72) 

NOV 9 

25 A53791 
P25098 
Q99LL8 
P26817 

30 N0V9 

A53791 



MADLEAVLADVSYLMAMEKSKATPAARASKKI LLPEPS I RSVMQKYLEDRGEVTFEKI FS 
MADLEAVLADVSYLMAMEKSKATPAARASKKI LLPEPS I RSVMQKYLEDRGEVTFEKI FS 
MADLEAVLADVS YLMAMEKSKATPAARASKKI LLPEPS I RSVMQKYLEDRGEVTFEKI F£ 
JjDLEAVLADVSYLMAMSKSKATPAARASKKI LLPEPS I RSVMQKYLEDRGEVTFEKI F£ 
MADLEAVLADVSYLMAMEKSKATPAARASKKI LLPEPS I RSVMQKYLEDRGEVTFEKI FS 
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10 



15 



P25098 
Q99LL8 
P26817 

NOV 9 

A53791 

P25098 

Q99LL8 

P26817 

N0V9 

A53791 

P25098 

Q99LL8 

P26817 




IB 




shpfsksatehvqghlgkkqvppdlfqpyieeicqglrgdvfqkfiesdkftrfcqwki 
shpfsksatehvqghlgkkqvppdlfqpyieeicoSlrgdvfok?iesdkftrfcqwk»W 
shpfsksatehvqghlgkkqvppdlfqpyieeicqSlrgdvfqkfiesdkftrfcqwkSv 
s h p fs icSat e h vqghlEkkqv p p d l fq p y i e e i cqSl r g d v fq k f i es dk ftr f cqw kSv 

SHPFSK^TEHVQGHLQKKOVPPDLFQPYIEEICQ^LRGDVF^KFIESDKFTRFCQVJK^V 



SHPFSi^TEHVQGHLffiKKQVPPDLFQPYIEEICQSLRGDVFOKFIESDKFTRFCQW»J 



eujlhltkgdfsvhriigrggfgevygcrkadtgkmyamkcldkkrikt-ikqgetlaljer 
edjlkltksdfsvhriigrggfgevygcrkgdtgkmyamkcldkkrikmkqgetlalser 
edmihltmSdfsvhriigrggfgevygcrk3dtgkmyamkcldkkriki^kqgetlalSer 

ITT .StHT.TmSdF£\/HR.T TnRnnFCiFVYnrR KADTnKN5YAMKrijDKXR T KMKDnRTI.AI ,I?)PP 



IHLTMgDFSVHRirGRGGFGEVYGCRKADTGKMYAMKCLDKXRIKMKQGETLALfflER 



180 
180 
180 
178 
180 

240 
240 
240 
238 
240 



20 



NOV9 

A53791 

P25098 

Q99LL8 

P26817 



IMLSLVSTGDCPFIVCMSYAFHTPDKLSFILDLMgGGDLHYHLSQHGVFSEADMRFYAAE 

imlslvstgdcpfivcmsyafhtpdklsfildlmgggdlhyhlsqhgvfsead. v irfyaae 
imlslvstgdcpfivcmsyafhtpdklsfildlkSggdlhyhlsqhgvfseadmrfyaae 
imlslvstgdcpfivcmsyafktpdklsfildlmSgcdlhyhlsqhgvfseadmrfyaae 
tmlslvstgdcpfivcmsyafhtpdklsfildlmSggdlhyhlsqhgvfseadmrfyaae 



300 
300 
300 
298 
300 



25 



NOV9 

A53791 

P25098 

Q99LL8 

P26817 



I ILGLEHMHijRFWYRDLKPAjILLDEHGHVRISDLGLACDFSKKKPHASVGTHGYMAPE 
i ilglehmhSrfwyrdlkpaSi LLDEHGHVRI SDLGLACDFSKKKPHASVGTHGYMAPE 
iilglehmhSrfwyrdlkpaSilldehghvrtsdlglacdfskkkphasvgthgymape 

1 1 LGLEHMhSrFWYRDLKPaSi LLDEHGHVRI SDLGLACDFSKK^PHASVGTHGYMAPE 

iilglehmhSrfwyrdlkpaiSilldehghvrisdlglacdfskkkphasvgthgymape 



360 
360 
360 
358 
360 



30 



35 



40 



45 



NOV 9 

AS3791 

P25098 

Q99LL8 

P26817 

NOV9 

A53791 

P25098 

Q99LL8 

P26817 

NOV9 

A53791 

P25098 

Q99LL8 

P26817 



LQKGVAYDSSADWFSLGCMLFKIiLRGHSPFRQHKTKDKHEIDRMTLTI4AVELPDSFSPE 
VLQKGVAYDSSADWFSLGCMLFKLLRGHSPFRQHKTKDKHEIDRMTLTt-lAVELPDSFSPE 

LQKGVAYDSSADWFSLGCMLFKLLRGHSPFRQHKTKDKHEIDRMTLTMAVELPDSFSPE 
"XQKGVAYDSSADWFSLGCNLFKLLRGHSPFRQHKTKDKHEIDRMTLTMAVELPDSFSPE 
VLQKGVAYDSSADWFSLGCMLFKLLRGHSPFRQHKTKDKHEIDRMTLTMAVELPDSFSPE 



lrsllegblqrdv»grrlgclgrgaqewvespffrsldwqmvflqkyppplipprgevigaa 
lJsllegllordvJrrlgclgrgaqevkespffrsldwqwflqkyppplipprgevSaa 

LfJsLLEGLLQRDvRRRLGCLGRGAQEVKESPFFRSLDWQrWFLQ^YPPPLIPPRCEvSAA 
LRSLLEGLLQRD^jRRLGCLGRGAQEVKESPFFRSLDWQrWFLOKYPPPLIPPRGEVijAA 
LRSLLEGLLORDvSRRLGCLGRGAQEBKESPFFRSLDWQrWFLQKYPPPLIPPRGEvSAA 



DAFDIGSFDEEDTKGIK^^^HJgjgBSagj^l^^^BQEVAETVFDTIijAETDRLEARK 

dafdigsfdeedtkgiklldsdqelyrgfpltiserwqqevaetvfdtlsaetdrleark 
dafdigsfdeedtkgiklldsdoelyrSfpltiserwooevaetvfdtiSaetdrleark 
dafdigsfdeedtkgiklldsdqelyrSfpltiserwqqevaetvfdtiSaetdrleark 
dafdtgsfdeedtkgiklldsdqelyrSffltiserwooevaetvfdtiSaetdrleark 



420 
420 
420 
418 
420 

480 
480 
480 
478 
480 

519 
540 
540 
538 
540 
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NOV 9 

A53791 

P25098 

Q99LL8 

F26817 



kakgkqlgheedyalgkdcimhgymskmq^pfltqwqrryfylfpgrlewrgegeaposl 
kakSkqlgheedyalgkdcimhgymskmoSpfltqwqrryfylffSrlewrgegeapqsl 
kakSkqlghsedyalgkdcimhgymskmgJpfltqwqrryfylfpSrlewrgegeapqsl 
kakSkqlghsedyalgkdciEhgymskmgJpfltqwqrryfylfpSrlewrgegeapqsl 
kak*kolgheedyalgkdcimhgymskmg5pfltowqrryfylfp"rlev/rge!3eapqsl 



579 
600 
600 
598 
600 



55 



NOV9 

A53791 

P25098 

Q99LL8 

P26817 



ltmeeiqsveetqikerkclllkirggkqfilqcdsdpelvqwkkelrdayreaqqlvqr 
ltmeeiqsveetqikerkclllkirggkqfilqcdsdpelvqwkkelrdayreaqqlvqr 
ltmeeiqsveetqikerkclllkirggkqfilqcdsdpelvowkkelrdayreaoqlvqr 
ltheeiosveetoikerkclllkirggkqfKlqcdsdpelvqwkkelrdayreaqqlvqr 
ltmeeiqsveetqikerkcli.lktrggkqfSlqcdsdpelvqwkkelrdayreaqolvqr 



639 
660 
660 
658 
660 



60 



NOV9 

A53791 

P25098 

Q99LL8 

P26817 



vpkmkjkprspwelskvplvqrgsaggl 
vpkmkSkprspwelskvplvqrgsaSgl 
pkmkSkprspwelskvplvqrgsaSgl 
vpxmk^kprspwelskvplnqrgsaggl 
vpkmkSkprspwelskvploqrgsaSgl 



668 
689 
689 
687 
689 



65 Tables 9E-9L list the domain descriptions from DOMAIN analysis results against 

NOV9. This indicates that the NOV9 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table IOC. BLAST results for NOV10 






(aa) 


I dent i ty 
(%) 


Positives 
(%) 


Expect 


ptnr : SPTREMBL- 


ALPHA MANNOSIDASE 
sapiens 


1062 


763/771 

/ QQH 


767/771 
(99%) 


0.0 


ptnr : SPTREMBL- 
ACC:Q9NTJ4 


HYPOTHETICAL 115.8 
KDA PROTEIN - 
Homo sapiens 


1040 


715/722 
(99%) 


718/722 
(99%) 


0.0 


ptnr : TREMBLNEW- 
ACC:AAH16253 


SIMILAR TO 
MANNOSIDASE, 
ALPHA, CLASS 2C, 
MEMBER 1 


1039 


635/730 
(89%) , 


692/730 
(94%) 


0.0 


ptnr:SWISSPROT- 
ACC:P21139 


Alpha -mannosidase 
(EC 3.2.1.24) 


1040 


625/731 
(85%) 


661/731 
(90%) 


0.0 


ptnr : SPTREMBL - 
ACC:Q13358 


ALPHA - MANNOS IDASE 
- Homo sapiens 


425 


425/425 
(100%) 


425/425 
(100%) 


0.0 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 10D. In the ClustalW alignment of the NOV10 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e, regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 10D. ClustalW Analysis of NOV10 

1) NOV10 (SEQ ID NO: 26) 

2) ptnr: ALPHA MANNOSIDASE 6A8B - Homo sapiens (SEQ ID NO: 73) 

3) ptnr: HYPOTHETICAL 115.8 KDA PROTEIN - Homo sapiens (SEQ ID NO: 74) 

4) ptnr: SIMILAR TO MANNOSIDASE, ALPHA, CLASS 2C, MEMBER 1 (SEQ ID NO: 75) 

5) ptnr: Alpha-mannosidase (EC 3.2.1.24) (SEQ ID N0:76) 



NOV10 

Q9UL64 

Q9WTJ4 

AAH162S3 

P21139 

KOV10 

Q9UL64 

Q9WTJ4 

AAH16253 

P21139 



MAAAPr LKh>. 
MAAAPFLKKK 
MAAAPgLKHK 
MAAAPFLKHk 
MAAAPFLKHW 




N0V10 
Q9UL64 



AQO LF.WVKSRY PGLYSR IQE FACRGQFV P7GGT W V EM DGi'L P S G E A>»V F.Q F L<QGC5 F F LQ E r G KHCS EFWLf 1 
? I F ACSOACO LEMV KSHY IK: LYSR I OE FACRGOFV PVGGTV.'VE.M DC?L PSCEA.MVRO F LOGOT'F FLOE FG KHCS EFHL F 



272 
371 



117 



f = = = -r? Tr 5.5 ~u "Zf «^ -5 r = 1^ 

«LaT *!.«!• 1** ^ «fc.tf tud* R ' Ul W «„tF ifc,J» (hu« 



5 

10 
15 

20 
25 
30 
35 
40 
45 
50 
55 



Q9MTJ4 

AAH16253 

P311J9 



J F I r AC S QAQQ LEWV KS R Y PG L Y S R I QE F A C RG 0 FV PVCGTWV END&T LPSG EAMVRQF LQGC-T K K LO E FC KMC S E r W LI 



F ACS O A 001 -EWV KSAYPGL 



JsffOE FACRGQFV PVGGTWV 



SGEAMVROFLOGC 



- TLCEgroggroAABH - 



371 
370 
261 




HOV10 
Q9UL64 
Q9NTJ4 

AAH16253 TN 
P31139 



».^HSAFLFCFCOGGGG PTQTKLDRLKRLS^rt5GLPRVQL.S5PRC l '" r *SALESDSEOLCTWGELFLE LHijGTYTTHAQl KK 
A«jHSAFLFCFCDGCGGPT0TMLDItLKRL£jrDGLPRV0LS5PRCLFSALESDSE0LCTWVCELFLELH^TYTTHA0I KF 
&"hSAFLFCFCIXXCGPTO™LDRL:<RLS^[XSLPRVQLS3PRC^ 
HSSFLFGFGDCCXJGPTOTTUDRLKRLS^DGnPRVOLSSP^ 



LPRVOLS 5 PROLES ALES DSEOLCTWVCELFLELHjj 



432 

531 
531 
530 
*Vjg£jg 325 




NOV10 
Q9UL64 
Q9NTJ4 
AAH16253 
P21139 

HOV10 
Q9ULS4 
Q9KTJ4 
AAH1C253 
P21139 

NOV10 

Q90L64 

Q9NTJ4 

AAH16253 

P21139 



AURWMDLSEHGFGLALL/jDCKYGASVRGS I LSLSLLRAPKAPDATAOTGRHEFTYALMFHKGSFQDAGV lQAAYSLjFPL 
RWMDLSEHGFGU^LL<jDCKYGASVRGSILSLSLLRAPKA?DATAI5TGRHEFTYAXMFHKGSFODAGViOAAYSL<jFPL 
AHRWMDLSEHGFGLAL1>?DCKYGASVRGS I LSLSLLRAPKAPDATADTGRHEFTYALMPHKGSFQDAGV IQAAYSIjSfPL 
AHRV>nDLSFf^FGI^Ll^T'CKYGASV3G?^LSLSLLRAPKA 3 DATADTGRHEFTYALMPHKGS?0"AGV ]<?AAYi?il »TiFPl 



832 
931 
931 
930 
408 




60 Table 10E lists the domain description from DOMAIN analysis results against 

NOV 10. This indicates that the NOV10 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 10E. Domain Analysis of NOV10 



Model 



Description 



Glyco_hydro_38 (Inter Pro) Glycosyl hydrolases family 38 
(SEQ ID NOtTll) 



Score 
140.5 



E-value 
le-39 



Glyco_hydro_38: domain 1 of 2, from 230 to 332: score 89.2, E = 5.4e-25 
*- >vtGGWVMnDEAttHyedlIdQlteGHqf LeenfGsdvkPkvgWsIDP 

|+|+||+ I 4- +-4- + I + I I 4- I + +•+ I J + +|+4-| + 

AC058790_d 230 VGGTWEMDGNLPSGEAMVRQFLQGQNFFLQEFG- -KMCSEFWLPDT 274 



AC058790 d 



FGHSatmPyLlraqaGfdgfllqRihYadKksfaetkqleFvWRqswslt 
I 1*1 4 |+ *| |+| ♦♦♦♦♦♦ +| | + | | + 

275 FG YSAQLPQI M- HGCGI RRFLTQKLS WNLVN S FPHHT FFWE GLD 317 



118 



:; ? h ST* i-r ~T ^tvt, 3 & a ZZ- 

tJ'rt'lmP •hc!»*T:i.P ir 'h^P »wP ~*G* a t«V «rd« n%.'¥ • *ttat 'hn? U-ji 



functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 



Table 11F. ClustalW Analysis of NOV11 

1) NOVlla (SEQ ID NO:28) 

2) NOVllb (SEQ ID NO:30) 

3) NOVllc (SEQ ID N0:218) 

4) ptnr: Clq-related factor precursor - Homo Bapiene (SEQ ID NO: 77) 

5) ptnr: Clq-related factor precursor - Mus musculus (SEQ ID NO: 78) 

6) ptnr: Gliacolin precursor - Mus musculus (SEQ ID NO: 79) 

7) ptnr: Complement Clq subcomponent (SEQ ID NO: 80) 



15 



20 



25 



30 



35 



40 



45 



NOVlla 
NOVllb 
NOVllc 
07 597 3 
088992 
Q9KSN4 
P02746 
AAH06983 



NOVlla 
NOVllb 
NOVllc 
075973 
OB8992 
Q9ESN4 
P02746 
AAH08983 



NOVlla 
NOVllb 
NOVllc 
075973 
088992 
Q9ESN4 
P02746 
AAH08983 




50 



55 



NOVlla 
NOVllb 
NOVllc 
075973 
088992 
Q9ESN4 
P02746 
AAH08983 




240 
-.1 



NGQVRAS 


183 


NGQVRAS 


183 


NGQVRAS 


232 


NGQVRAS 


203 


NGQVRAS 


203 


NGQVRAS 


203 


--AQHVV 


193 


--AQ§W 


195 



60 



NOVlla 
NOVllb 
NOVllc 
075973 
088992 



250 

..I.. 



260 



270 



280 

J.. 



290 



I AQDADQN YDYASNS VI LHLD jGDSVFI KLDGGKffiHGGNAN KYSTKSGFI I Y PC 

aiaqdadqnydyasnsvilhlcBgdevfikldggkBhggn«nkystfsgfiiypc 

ArAQDADQWYDYASNSV2LHLCgGDETV^KLDGGKgHGGNSNKYSTFSG?i2J YPi: 
AIAQDADQNYDYASNSVILHLCgGDEVFIKLDGGKgHGGNgNKYSTFSGFIIYgC 
AI AQDADQN YDYASNS VI LHLdBgDEVFI KLDGGKHHGGNgNKYSTFSGr 1 1 YgC 



238 
- 238 
-- 287 
-- 258 
-- 258 



127 



w!h\ n-rr t»it» ImP ,r •h.c icf • -a tn h-«p .™p ,n5n 
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Tables 1 1E-1 IF list the domain descriptions from DOMAIN analysis results against 
NOV1 1 . This indicates that the NOV1 1 sequence has properties similar to those of other 
proteins known to contain this domain. 



15 



20 



Table HE. Domain Analysis of NOV11 

gnl 1 Smart | smartOOllO , C1Q, Complement component Clq domain.; Globular 
domain found in many collagens and eponymously in complement Clq. When 
part of full length proteins these domains form a 'bouquet 1 due to the 
multimerization of heterotrimers . The Clq fold is similar to that of 
tumour necrosis factor. (SEQ ID NO: 104) 
CD-Length - 132 residues, 99.2% aligned 
Score b H3 bits (283), Expect » le-26 



Query : 


108 


Sbjct : 


2 


Query: 


166 


Sbjct: 


60 


Query: 


225 


Sbjct: 


119 



165 



PRIAFYAGL - -RJIPHEGYEVLRFDDWTNVGNAYEAASGKFTCPMPGVYFFAYHVLMRGG 

II II II ♦♦III I* I I* ♦+IIIIIMIIM*II* ♦ 

PRSAFSVIRSTNRPPPPGQPVRFDKVLYNQQGHYDPSTGKFTCPVPGVYYFSYHIESK-- 59 



DGTSMWADLMKNGQVRAS A I AQDADQNYDYASNSVI LHLDVGDEVFI KLDGGKVHG -GNT 

I Mill ♦ I II ♦! I IM^II I 

♦GRNVKVSLMKNGIQVMRECDEYQKGLYQVASGGALLQLRQGDQVWLELDDKKNGLYAGE 



111111*^11 



224 



118 



25 



Table 11F. Domain Analysis of NOV11 

gnl | Pfam|pfam003 86 , Clq, Clq domain. Clq is a subunit of the CI enzyme 
complex that activates the serum complement system. (SEQ ID NO: 112) 
CD-Length = 125 residues, 100.0% aligned 
Score « 102 bits (253) , Expect o 3e-23 



30 



35 



Query: 


111 


Sbjct: 


1 


Query: 


170 


Sbjct : 


58 


Query: 


228 


Sbjct: 


118 



AF YAGLR - RPHEGYEVLRFDD VVTNVGNAYEAASGKFTCPMPGVYFFA YHVLMRGGDGTS 169 

III II + + IM* I I* MIIIIMI + M +1 1 ♦ 11 + 

AFTAIRSTRPPAPGQPVIFDEVLYNQQGHYDPATGKFTCPVPGLYYFNFHVSSK- - -GTN 57 

MWADLMKNGQVRASAIAQDADQNYDYASNSVILHLDVGDEVFIKLDGGKVHG- -GNTNKY 227 

♦ 11*11 I ♦ I III +1 I II l+*+ll ♦ ♦! I 

VCVSLMRNGVPVMSFCDEYAKGTYQVASGGAVLQLRQGDRVWLELDDKQTNGLLGGEGVH 117 



I IIII++ 



The first component of complement system is a calcium-dependent complex of the 3 
40 subcomponents C 1 q, C 1 r, and C 1 s. Subcomponent C 1 q binds to immunoglobulin complexes 



128 



n<B« Rwf n*r r t---" r 



W~*tr !? n«T niV ttwtf 



ptnr :SPTREMBL- 


PLEXIN 3 - Mus 


1872 


1245/1874 


1478/1874 


0.0 


ACC:P70208 


musculus 




(66%) 


(78%) 





The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 12D. In the ClustalW alignment of the NOV 12 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 

15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 



i) 

2) 
3) 
4) 
5) 
6) 



Table 12D. ClustalW Analysis of NOV12 

NOV12 (SEQ ID NO: 32) 

ptnr: PLEXIN 1 - Mus musculus (SEQ ID NO: 81) 
ptnr: NOV/ PLEXIN- Al PROTEIN - Homo sapiens (SEQ ID NO: 82) 
ptnr: PLEXIN PRECURSOR - Xenopus laevi3 (SEQ ID NO: 83) 
ptnr: Plexin A3 precursor (Plexin 4) (SEQ ID NO: 84) 



mmltpaopbhrgprpqpampBpprslqvlll] 

0PLPPLSSRTL1 




.rahvtgpved 
-rahvtcpved 
-rahvtcpved 
.fquvtcpved' 
.rahvtcpved 



LDDLFKLGEPHHRKEHYLSSVOEAGSf J lAQVLl AGPPGQGOAKLFVGTP J DGKSEVFPTL.SS RRLMAjEEDADMFGFVYQr. 
LDDLFKLGEPHHRKEHY LSSVi^EAGSMACVLl AGPPCOCOAKLFVGTPIOGKSEVFPTLSSRR1J^?EECA2>1FGFVYOC 
LnSLFKLGEPHHRKEtiYLSSVQEftG SMAGVLlACPPGQCOAKLFVGTPrD G KSEYFPTLSSRR L f-Lv'E EOAgMFGFVYOL- 

lddlfklgefhhrkehylssv«2e3g 

LDDLFKLCE PHHHKEHY LS22jES£5MAGV»(| 



efvssqlki psdtlskfpafdryyvysfrse0fvyyl.tl0ldtolt5pdaacehfftsklvrlcvgdpkfy3vvefhlct 
sfvssqlkipsdtlskfpafdiyyvysfrseofvyyltloldtoltspdaagehfftskivrlc\''Jd?kfysyvefpigc 
f.fvssc-i'ki psrn'lskfpafdtyyvysfrseofv'yyltr^ldtoltspoaagehfftskrvrlcvisopkfysyvefplgc 



vovxa 

P70206 

Qsuxwa 

Q91823 flLLHABRPLP: 

P51805 

NOV12 
P70306 

Q91823 
PS1805 

NOVli 
P70206 
Q9UXW2 
Q91623 
P51B05 

NOV12 
P70206 
Q9UIW3 
091823 
P51B05 

NOV13 
P70206 
Q90IW2 
091823 
P51B05 

HOVIJ 
P70206 
Q9UIV2 
091823 
FS1B05 

HOV12 
P70206 
Q9UIW3 
091823 
PS1B05 

NOV J 2 
P70206 
Q90IW2 
Q91B23 
PSIB05 

HOV12 
P70206 




EOAGVEYRLVOCAY LSR PC^ALA^iOLCLAEDEDVLFTVFA0G0^]iRVKPPKESALCLFTLRAI KEKI KEK lOiJCYRGEGh 
EQAGVEYRLVQDAYLSRPcJaI^QOLGLAEDEi^LFTVFA^O^'IRVKPPKESALCLFTLRAI K£K^ KERJ OSCYRGEGr 
SOAGVEYRLVO^AYLSRPCiALAloLGLAEDEDVLFTVFAOCOKijRVKPPKESALCLFTLRArKEKIKERIOSCYRCEGK 



240 
220 
200 
260 
202 

320 
300 
280 
260 
262 

400 
380 
360 
160 
362 



em 






SSitjMEfefflljffiTffi?^^ 


M 




^Tgrpalave 
5:;rpalay£ 

Stmgggmm 


SVVAQEGSPl LKOLVLSH 
SWAQECjPI LRULVLSn 
SWAQECSPILRDLVLSFj 


iHQYLY A v t7E KQ VI v[v P V F.S C VC. Y'l S CK LC IjG S R D PK CG WC V LH S' 

JjIoylyamtekovtTvpvescvoytscelclgsrdpmcgwct/i.hs 
jhoylyayteko'/'r^vpvescvcytscelclgshcphcgwcvlus 


r 

ICS 



460 
440 
160 
442 

560 
54 0 
530 
160 

@G Fpr^iJ^^Pvv^niWTI^FHaDrafiHiT^^ 518 

639 
61B 
59B 
160 
597 

719 
695 



FgDACERAVEPQRFAjDLLQCVOLTVOPFjVSVTMSiQV^ 
pSdACERa2ePORFa5dLI^CVOLTV0PrJv3VTM3OVF 
KgDACER A? E POH r aJJu I .WCVQ LTVO PFlJvS V7MSQV T 


i L V LQAVC j.V PD LSAGVjj CS F E D FT EgES V L E 

lvloauJvpdlsagvjcsfedftcijesQle 
i ,v rvOAw?>v ?nr.s anvijcs f ec fte?f r.c 


[DCRIHCgSr 

joGRiKojsr 









3DQRVVKL.Y L K.S KSTC K K F AS V DF V F 
3DORW KLYLK3 KETG KKFASVDFVF 



135 



iuOw Oar ^m? tu.F »r «h-tf "C IS "w? ov..F «hu!> u«5u «I,»P tK«n 



H^MiMji^ity --iana3PmHK ii^ 



«75 
160 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 
90 



HOV12 
P70206 



SRvgJSEDCPOlLPS^JlYVPVCWKPITLAAFTiLP^ 

"RVj^SEDCPQILPSTgr.-VPVGVVKPITLAApSLPQPOSGOPCYECLFHIPgSPARVTALRFSsSSLQCc'sSYSYEG 1 



« 799 
J 775 
4 755 
160 
754 



DVS DLPV JUS VVfc^jF V I DjPOjl OAH LY KCPA^ 
DVSDLPVJLSyVwToTiFV I [>JpC"J^ ^ 

DVS DLPvSlS VVfcSdSFV I dSpcS 1 OAliLY KCPALR^SCGLCLKADPRFECCWCVAE RRCS LRHHOTAEmPASHKHaRhC 



B79 
655 
835 

- 160 



SSRCTDPKl LKLSPETGPRQGGTRLTlTCEgUJLRFEDVRIXVgVGKVLCSPVESEY ISAEQ1 VCE IGDASSJiRAHDALV 
SSRCTOPK I LKLSPKTCPROGCTRLTITCE'JlCLRFEDVRLCvSvGKVLCSPVESEY I SAEOJ VCEIGDAsjgRAHDALV 
SSRCTnFKTLKt.SPETG?RQGCTRLTITCDS^CLRFEDVRLGv3vG)CVLCSPVESFYISftF01VC5IGDAs55RAHDALV 



959 
93S 

915 

" - 160 

GM^HgRllr^H^V^Xfl^aSvflav^jfgnLS RfeflGBRlSAq - - ERMsgp ACTre?3S R lI7gaE ga LV PS P P PO pfl 911 



bVCVRi^£gHYRALSPKRrTFVTPTFYRVSPSRGPLSGGTWiCIEGSHl.'TiAGSDVAVSVGGKPCSFSK 
EVO/RDCSgHYRALSPKRFTFVTPTrYRVSPSRGPLSGGTWIGl EGSHIjJJaGSOVAVSQgGRPCSFSV. 
EVCVRDC£™HYRALS PKRF TFVTPTFYRVS PSRGPLSCCTW IC 1 ECSHl.?jACSOVAVSVGGR PCS PS V: 



1039 
1013 
993 
160 



eSq|1vr--Bd£E a 0 v GH 989 



PGSAP1 MliTi JJRAQLj 
PGSAPiSlSlijRAOL 
PGSAPl3l"™ltSRAQL< 



I P E V K Yj JY TE DPT i LR 1 DPEWS 1'jS GGT L LT VTCT Jl-AT V RE PR I RAKYGGI ERE 
LggPEVKYtBYTEDPT I LR I DPEWS 2 'JiSGGTLLTVTCT'JlATVRE PRI RAKYGG I ERE 1 
P E V KYa V T E D PT I L R I D PE W S I 'JsCGT L LTVTGtSl ATV R E P R [ RA K Y GG I E R E 



[^tlgSso32iSaOdQ^<S1sQg3i 



1117 
WSH 1092 
tfGH 1072 
160 
1069 



Q9UXW2 
Q91B23 
P51805 

K0V12 
P70206 
Q9UZW2 
091823 
P5180S 

NOV12 
P70206 
Q9UIW2 
Q91823 
PS1805 

N0V12 
P7020C 
Q9UIW2 
Q91823 
P5180S 

NOV12 
P70206 
Q9UZW2 
Q91B23 
PS1805 

NOV 12 
P70206 
Q9UIW2 
Q91823 
P51805 

NOV 12 
P70206 
Q9UXM2 
091823 
P51B05 

NOV12 
P70206 
Q9UZW2 
091023 
P51805 

NOV12 
P7020C 
Q9UXW2 
Q91823 
P51805 

NOV12 
P70206 
Q9UIW2 
091823 
P51805 

NOV12 
P70206 
Q9UIW2 
Q91823 
P51605 

Movxa 

P70206 
Q9UXW2 
091823 
P51805 

NOV12 
P70206 
09UZW2 
091833 
P51805 

P70206 ifflg.^ r. r .n r Bgiggffl-BBB^ ^ 17 io 

Q9uiw2 ffikTOW.]!«.MagH8^^ 1710 

Q91«3 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ^ 160 
091B23 ^f^^^ 160^ 




LLL.LV 1 VAVL I AYlvRKSRDADRTLKRLOt^HDjLESRVALECKEAFASLCTD I HELUjDLDGAG I PFLDYRTYA/IRVLFF 
LLLLV I VAVL I AYKHKSRDADRTLKRUOLOHdSlESRVALECKEAFASLOTD I HELlSoLOGAG I PFLDYRTYAMRVLFF 
AA.l.V 1 VAVLT AYKPKSRDADRTLKRLQLQMI>"lESRVALECKEAFAELCTD I HELl 



gDLDGAG I PFLDYRTYAMRVLFF 
™0 L DGAC I P F LDY RT Y AM R V L P F 



IEDHPVLKEMEVOAjj|VEKS LTLFCOLLTKKHFLLTF 1 KTLEAORSFSMRDRCjVASL 1MTALOGEMEYATGVLKQLLSC 
IEDHPVLKEMEVOAWEKSLTLFGQuLTKKHFLLTFIRTLSAORSFSMRDROWASLIMTALOGEHEYATCVLKQLLSC 



IKPH PVLKEMEVQ/X^VEKSLTLFGOLLTKKNFLLTK 1 RTLEAQRSFSMRDR' 



'C-jVASLI 
IC-SvASL! 



MTALCGEMEYATCVLKQLLSC 
MTALOGEHEYATGVLKOLLSC 



TESVAEKMLPjWFTFLLYKFI.KECAGEPLFMLYCAIKCOMEKGPIDAlTGEARYSLSEDia 
TESVAEW^LT-jwFTFLLYKFLKECAGEPLFMLYCAIKCOMEKGPIDAITUEARYSLSEDKL 
TESVAEKMLTjwFTFI.I.YKFLKSCAGEPUFMLYCA T KCOMEKGPIDA ITGEARYSLSEDKI. 



1357 

1332 

1312 

160 

1309 

1437 

1412 

1392 

160 

1389 

1517 
1491 
1471 
160 



WPVFSbjfcl&lrOAhn KLULiHykGMPYSQRPKAPDMDLEKROGRXuR 1 I LOOBDV 



1597 

1571 

1551 

160 

1548 




1917 
1886 



».Un «.-r iwr <r n..i!» -n m w w..p ^.o« «i!v» ■»!*■ n»«n 



10 



- X7S4 

091823 - HI 4 

P5180S m-EBRgES^aidrlilra^^ -miCTS^^c^t<m3 cCTy?a Ti i8 63 

N0V13 ffDTfgLSa 1925 
P70206 gDlfgLSg 1894 

Q9UIW2 175« 

Q92823 160 

P51605 gSl^SCfl 1871 

Tables 12E-12N list the domain descriptions from DOMAIN analysis results against 
NOV12. This indicates that the NOV 12 sequence has properties similar to those of other 
proteins known to contain this domain. 



15 



Table 12E. Domain Analysis of NOV12 



gnl | Smart: } Bmart00630 , Sema, seroaphorin domain 
CD-Length = 430 residues, 100.0% aligned 
Score * 242 bits (618), Expect » le-64 



(SEQ ID NO : 113 ) 



20 



25 



30 



35 



40 



45 



Query : 69 LTHLVVHEQTGEVYVGAVNRIYKLSGNLTLLRAHVTGPVEDNEKCYPPPSVQSCPHGLGS 128 

I ♦!♦* I I -llll 11*1 II II Mil + | | 

SbjCt: 1 LQNLLLDEDNGTLYVGARKRLYVLSLNLISEAEVKTGPVLSSPDCEEC- -VSKGKDPP- - 56 

Query : 129 TDNVNK-LLLLDYAANRLLACGS -ASQGICQFLRLDDLFKLGEPHHRKEHYLSSVQEAGS 186 

II II Hill I- II II* I I +1* * I *l *l I 

SbjCt : S? TDCVNFIRLLLDYNADHLLVCGTNAFQPVCRLINLGNLDRL-EVGRESGRGRCPFDPQHN 115 

Query : 187 MAGVLI AGPPGQGQAKLFVGTP I D - -GKSEYFPTLSSRRLMANEEDADMFGFVYQDEFVS 244 

II- I -1*111 || ||* | 
Sbjct : 116 STAVLVDG-- ELYVGTVADFSGSDPAI YRSLSVRRLKGTSG PSLRTVL 161 

Query : 245 SQLKIPSDTLSKFPAFDIYYVYSFRSEQFVYYLTLQLDTQLTSPDAAGEHFFTSKIVRLC 304 

+ * * + IM I Ml* * | |++ |*| 

Sb3Ct: 162 YDSRWLN EPNFVYAFESGDFVYF FFRETAVEDENCGKAWSRVARVC 208 

Query: 305 VDD PKFYSYVEFPIGC EQAGVE YRLVQDA YL SR PGRALAHQLGLAEDED 353 

*l I* I*** * I * * +| |*| | + | + 1 

Sb}Ct : 209 KNDVGGPRSLSKKWTSFLKARLECSVPGEFPFYFNELQAAFLLPAG SESDD 259 

Query : 354 VLFTVFAQGQKNRVKPPKESALCLFTLRAIKEKIKERIQSCYRGEGKLSL PWLLNK 409 

H + II* I 11*1 1*1 I |*|| + 

SbjCt: 260 VL YGVFSTS SNPI PGSAVCAFSLSDINAVFNEPFKECBTGNSQWLPYPRGLVPFPR 315 

Query: 410 ELGCINSPLQI DDDFC-GQDFNQPLGGTVTIEGTPLFV- -DKDDGLTAVAA Y 4 58 

I 1*11 II + | MM | + ||++| 

SbjCt : 316 PGTC PNT PL SSKDL P DD VLN F I KTHPIiMDEWQ PLTGRPLFVKTDSNYLLTS I AVDR VRT 375 

Query : 459 DYRGRTWFAGTRSGRIRKILVDLSNPGGRPALAYESWAQEGSPILRDLVLSPNH 514 

i 11*1 ii iii i*** i* ♦ i i in* nun 

Sbuct : 376 DGGNYTVLFLGTSDGRILKWLSRSSSSSESWLEEISVFDPGSPV-SDLVLSPKK 430 
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